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This is an express request to begin national examination procedures (35 U.S.C 371(0) at any time rather than delay 
exammation until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 39(1). 

A proper Demand for International Preliminary Examination was made by the 1 9th month from the earliest claimed priority date. 

A copy of the International Application as filed (35 U.S.C. 371 (c) (2)) 

a. □ is transmitted herewith (required only if not transmitted by the International Bureau). 

b. K has been transmitted by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 
A translation of the International Application into English (35 U.S.C. 371(c)(2)). 
A copy of the International Search Report (PCT/ISA/21 0). 

Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371 (c)(3)) 

a. □ are transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ have been transmitted by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made. 

A translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

An oath or declaration of the inventor(s) (35 U.S.C 371 (c)(4)). 

A copy of the International Preliminary Examination Report (PCT/IPE A/409). 

A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 
(35 U.S.C. 371 (c)(5)). 

13 to 20 below concern document(s) or information included: 

An Information Disclosure Statement under 37 CFR 1.97 and 1 .98. 

An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.3 1 is included. 
A FIRST preliminary amendment. 

A SECOND or SUBSEQUENT preliminary amendment. 
A substitute specification. 

A change of power of attorney and/or address letter- 
Certificate of Mailing by Express Mail 
Other items or information: 
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-Disk containing Sequence Listing in Word Format 
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■Return Receipt Postcard 



Page 1 of 2 



PCTUS1/REV03 



U.S. APPLICATION 



INTERNATIONAL APPLICATION NO. 
PCT/EPOO/01903 



ATTORNEY'S DOCKET NUMBER 
Kreisler 1098-KGB 



21. The following fees are submitted:. 

BASIC NATIONAL FEE ( 37 CFR 1.492 (a) (1) - (5)) : 

□ Neither international preliminary examination fee (37 CFR 1 .482) nor 
mternational search fee (37 CFR 1 .445(a)(2) paid to USPTO 

and International Search Report not prepared by the EPO or JPO $1,000.00 

^ International preliminary examination fee (37 CFR 1 .482) not paid to 
USPTO but Internation Search Report prepared by the EPO or JPO 
International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1.445(a)(2)) paid to USPTO 

International preliminary examination fee paid to USPTO (37 CFR 1 .482) 
but all claims did not satisfy provisions of PCT Article 33(I)-(4) 



□ 
□ 



□ 



$860.00 
$710.00 
$690.00 



International preliminary examination fee paid to USPTO (37 CFR 1.482) 
and all claims satisfied provisions of PCT Article 33(1 )-(4) 



$100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



Surcharge of $130.00 for furnishing the oath or declaration later than 
months from the earliest claimed priority date (37 CFR 1.492 (e)). 



□ 20 



□ 30 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



$860.00 



$0.00 



Total claims 



30 



20 = 



10 



$18.00 



$180.00 



Independent claims 



3 = 



$80.00 



$0.00 



Multiple Dependent Claims (check if applicable). 



□ 



$0.00 



TOTAL OF ABOVE CALCULATIONS = 



Reduction of 1/2 for filing by small entity, if applicable. Verified Small Entity Statement 
must also be filed (Note 37 CFR 1.9, 1.27, 1 .28) (check if applicable). 



$1,040.00 



□ 



$0.00 



SUBTOTAL = 



$1,040.00 



Processing fee of $130.00 for furnishmg the English translation later than 
months from the earliest claimed priority date (37 CFR 1 .492 (f)). 



□ 20 



□ 30 



$0.00 



TOTAL NATIONAL FEE 



Fee for recording the enclosed assignment (37 CFR 1 .21 (h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.3 1) (check if applicable). 



$1,040.00 



□ 



$0.00 



TOTAL FEES ENCLOSED 



$1,040.00 



Amount to be: 
refunded 



charged 



□ A check in the amount of 

(3 Please charge my Deposit Account No. 

A duplicate copy of this sheet is enclosed. 



to cover the above fees is enclosed. 



14-1263 in the amount of $1,040.00 



to cover the above fees. 



The Commissioner is hereby authorized to charge any fees which may be required, or credit any overpayme/t 
to Deposit Account No. 14-1263 A duplicate copy of this sheet is enclosed. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a^petkion to r^viVe (37 CFR 
37(a) or (b)) must be fded and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 



IVorris McLaughlin & Marcus 
220 East 42nd Street 
30th Floor 

New York, New York 10017 



REGISTRATION NUMBER 



DATE 
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U.S. DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 
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ATTORNEY'S DOC 

Kreisler 1098-Ki 



U.S. APPLICATION NO. 

09/9 



37CFR 



INTERNATIONAL APPLICATION NO. 
PCT/EPOO/0 1 903 



INTERNATIONAL FILING DATE 

3 March 2000 (03.03.00) 



PRIORITY DATE CLAIMED 

6 March 1999 (06.03.99) 



TITLE OF INVENTION 

STABLE RECOMBINANT INFLUENZA VIRUSES FREE OF HELPER VIRUSES 



APPLICANT(S) FOR DO/EO/US 

GERD HOBOM, RAMON FLICK, ANETTE MENKE and MAYSA AZZEH 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 



1. 
2. 
3. 

4. 
5. 



6. 
7. 
8. 



□ 



□ 
□ 



9. 
10. 
11. 
12. 



n 
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This is a FIRST submission of items concerning a filing under 35 U.S.C, 371. 

This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 37L 

This is an express request to begin national examination procedures (35 U.S.C. 371 (f)) at any time rather than delay 
examination until the expiration of the applicable time limit set in 35 US.C. 371(b) and PCT Articles 22 and 39(1). 

A proper Demand for International Preliminary Examination was made by the 1 9th month from the earliest claimed priority date. 

A copy of the International Application as filed (35 U.S.C. 371 (c) (2)) 

a. □ is transmitted herewith (required only if not transmitted by the International Bureau). 

b. K has been transmitted by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 
A translation of the Intemational Application into English (35 U.S.C. 371(c)(2)). 

A copy of the Intemational Search Report (PCT/ISA/2I0). 

Amendments to the claims of the Intemational Application under PCT Article 19 (35 U.S.C. 371 (c)(3)) 

a. □ are transmitted herewith (required only if not transmitted by the Intemational Bureau). 

b. □ have been transmitted by the Intemational Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made. 

A translation of the amendments to the claims under PCT Article 19 (35 US.C. 371(c)(3)). 

An oath or declaradon of the inventor(s) (35 U.S.C. 371 (c)(4)). 

A copy of the Intemational Preliminary Examination Report (PCT/IPEA/409). 

A translation of the annexes to the Intemational Preliminary Examination Report under PCT Article 36 
(35 US.C. 371 (c)(5)). 



Items 13 to 20 beiow concern documeiit(s) or information included: 



13. 
14. 
15. 
16- 
17. 
18. 
19- 
20. 



□ An Information Disclosure Statement under 37 CFR 1.97 and L98. 

□ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 
K A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

□ A substitute specification. 

□ A change of power of attorney and/or address letter, 

□ Certificate of Mailing by Express Mail 
K Other items or information: 



Statement Regarding Sequence Listing 
Disk containing Sequence Listing in Word Format 
-Express Mail Certificate 
Return Receipt Postcard 
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Express Mail Label No. EL 867734761 US 

Date Mailed: August 31. 2001 Kreisler 1 098-KGB 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

APPLICANTS : GERD HOBOM, RAMON FLICK, ANETTE MENKE and MAYSA 
AZZEH 

SERIAL NO. : TO BE ASSIGNED (International Application No.: PCT/EPOO/01 903) 
FILED HEREWITH 

FOR : STABLE RECOMBINANT INFLUENZA VIRUSES FREE OF 

HELPER VIRUSES 

ART UNIT : TO BE ASSIGNED 

EXAMINER : TO BE ASSIGNED 



August 31, 2001 



Hon. Commissioner of Patents 
Washington, D.C. 20231 



PRELIMINARY AMENDMENT 

SIR: 

Prior to examination, please amend the above-identified application as follows: 



IN THE SPECIFICATION : 

Insert the following new paragraph as the first paragraph of the application: — This application 
is a 371 of PCT/EPOO/01 903 filed on March 3, 2000. - - 



IN THE CLAIMS : 

Please amend claims 3, 5, 10-23 and 25-30 as follows: 
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GERD HOBOM ET AL 
INTERNATIONAL APPL, NO.: PCT/EPOO/01903 

3. The recombinant influenza virus according to claim 1, wherein one or more of the regular 
viral RNA segments, differing from said at least one ambisense RNA segment, comprises a 
vRNA encoding a foreign gene, preferably one or more of the regular viral RNA segments has 
(have) been exchanged for a vRNA encoding a foreign gene. 

5. The recombinant influenza virus according to claim 1, in which the terminal viral RNA 
sequences of one or more of the regular segments and/or of the at least one ambisense RNA 
segment, which are active as the promoter signal, have been modified by nucleotide substitutions 
in up to five positions, resulting in improved transcription rates of both the vRNA promoter as 
well as the cRNA promoter as present in the complementary sequence. 

10. The recombinant influenza virus of claim 6, wherein the 5' terminal nucleotide sequence 
comprises the modifications U3A and A8U resulting in a 5'-terminal sequence of 5 - 
AGAAGAAUCAAGG. 

11. The recombinant influenza virus according to claim 1, which is a recombinant influenza 
A virus. 

12. The recombinant influenza virus according to claim 1, in which the foreign gene(s) in 
ambisense covalent junction with viral gene(s) code for proteins and/or glycoproteins which are 
secreted from cells infected with the recombinant virus. 

2 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/01903 

13. The recombinant virus according to claim 1, in which the foreign gene(s) in ambisense 
covalent junction with viral gene(s) code for proteins or artificial polypeptides designed to 
support an efficient presentation of inherent epitopes at the surface of infected cells, for 
stimulation of a B cell and/or T cell response. 

14. A method for the production of recombinant influenza viruses as defined in claim 1 
comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, in ambisense design, 

(b) followed by infection with an influenza carrier strain constructed to include flanking 
ribozyme target sequences in at least one of its viral RNA segments which is (are) to be replaced 
by the ambisense segments of step (a), and 

(c) thereafter selective vRNA inactivation through ribozyme cleavage. 

15. A pharmaceutical composition comprising a recombinant influenza virus according to 
claim 1. 

16. Method of using a medicament comprising a recombinant influenza virus according to 
claim 1 for vaccination purposes. 

17. The method according to claim 16, wherein the medicament 
(a) is suitable against influenza and/or against other infections; 

3 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/01903 

(b) is present in form of inactivated preparations; and/or 

(c) is present in form of live recombinant viruses. 

18. Method of using an agent comprising a recombinant influenza virus according to claim 1 
for somatic gene therapy. 

19. Method of using an agent comprising a recombinant influenza virus according to claim 1 
for transfer and expression of foreign genes into cells infected by such viruses. 

20. Method of using an agent comprising a recombinant influenza virus according to claim 1 
for transfer and expression of RNA molecules into cells infected by such viruses. 

21 . The method of claim 20, wherein the RNA molecules to be expressed are antisense 
sequences or double-strand sequences relative to the target cell cellular mRNA molecules, and/or 
the agent is suitable for sequence-specific gene silencing, preferably by antisense RNA or RNA 
interference mechanisms. 

22. The method according to claim 18, w^herein the agent is appUcable in ex vivo and in vivo 
application schemes. 

23. A method for the production of proteins or glycoproteins which comprises utilizing a 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/0 1 903 

recombinant influenza virus according to claim 1 as expression vector. 

25. A method for preventing and/or treating influenza which comprises administering an 
effective amount of a recombinant influenza virus according to claim 1 to the mammal to be 
treated. 

26. A method for somatic gene therapy, which method comprises subjecting the organism to 
be treated with a recombinant influenza virus according to claim 1 . 

27. A method for transfer and expression of foreign genes into cells, and for transfer and 
expression of RNA molecules into cells, which method comprises infecting the cells with a 
recombinant influenza virus according to claim 1 . 

28. Method of using an agent comprising a recombinant influenza virus according to claim 1 
for autologous immunotherapy. 

29. A method for an immunotherapy which comprises ex vivo infection of immune cells with 
a recombinant influenza virus according to claim 1, and introduction of the transduced cells into 
the patient. 

30. A method for the induction of antibodies which comprises utilizing a recombinant 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/0 1 903 



influenza virus according to claim 1 as an immunogen. 



Applicants intend the examination to begin based on the amended claims filed May 21, 



The amendments above remove multiple dependencies to reduce the filing cost, and 
convert the "use" claims to the conventional "Method of use" claims. 

Early and favorable action is earnestly solicited. 



KGB:ja 

220 East 42"<' Street 
30"^ Floor 

New York, New York 10017 
(212) 808-0700 



REMARKS 



2001. 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/01903 

MARK-UP SHOWING THE CHANGES MADE IN THE PREVIOUS CLAIM TO YIELD 
THE CLAIM AS AMENDED ABOVE 

3. The recombinant influenza virus according to [claims 1 and 2] claim 1 . wherein one or 
more of the regular viral RNA segments, differing from said at least one ambisense RNA 
segment, comprises a vRNA encoding a foreign gene, preferably one or more of the regular viral 
RNA segments has (have) been exchanged for a vRNA encoding a foreign gene. 

5. The recombinant influenza virus according to [claims 1 to 4] claim 1 , in which the 
terminal viral RNA sequences of one or more of the regular segments and/or of the at least one 
ambisense RNA segment, which are active as the promoter signal, have been modified by 
nucleotide substitutions in up to five positions, resulting in improved transcription rates of both 
the vRNA promoter as well as the cRNA promoter as present in the complementary sequence. 

10. The recombinant influenza virus of [claims 6 to 9] claim 6 . wherein the 5* terminal 
nucleotide sequence comprises the modifications U3A and A8U resulting in a 5'-terminal 
sequence of 5*-AGAAGAAUCAAGG. 

1 1 . The recombinant influenza virus according to [claims 1 to 10] claim 1 , which is a 
recombinant influenza A virus. 

12. The recombinant influenza virus according to [claims 1 to 1 1] claim L in which the 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/01903 

foreign gene(s) in ambisense covalent junction with viral gene(s) code for proteins and/or 
glycoproteins which are secreted from cells infected with the recombinant virus. 

13. The recombinant virus according to [claims 1 to 1 1 claim 1 , in which the foreign gene(s) 
in ambisense covalent junction with viral gene(s) code for proteins or artificial polypeptides 
designed to support an efficient presentation of inherent epitopes at the surface of infected cells, 
for stimulation of a B cell and/or T cell response. 

14. A method for the production of recombinant influenza viruses as defined in [claims 1 to 
13] claim 1 comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, in ambisense design, 

(b) followed by infection with an influenza carrier strain constructed to include flanking 
ribozyme target sequences in at least one of its viral RNA segments which is (are) to be replaced 
by the ambisense segments of step (a), and 

(c) thereafter selective vRNA inactivation through ribozyme cleavage. 

15. A pharmaceutical composition comprising a recombinant influenza virus according to 
[claims 1 to 13] claim 1 . 



16. [Use of] Method of using a medicament comprising a recombinant influenza virus 
according to [claims 1 to 13 for preparing a medicament] for vaccinafion purposes. 



GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EPOO/01903 

17. The [use] method according to claim 16, wherein the medicament 

(a) is suitable against influenza and/or against other infections; 

(b) is present in form of inactivated preparations; and/or 

(c) is present in form of live recombinant viruses. 

18. [Use of] Method of using an agent comprising a recombinant influenza virus according 
to [claims 1 to 13 for preparing agents] claim 1 for somatic gene therapy. 

19. [Use of| Method of using an agent comprising a recombinant influenza virus according 
to [claims 1 to 13 for preparing agents,] claim 1 for transfer and expression of foreign genes into 
cells infected by such viruses. 

20. [Use of| Method of using an agent comprising a recombinant influenza virus according 
to [claims 1 to 13 for preparing agents] claim 1 for transfer and expression of RNA molecules 
into cells infected by such viruses. 

21. The [use] method of claim 20, wherein the RNA molecules to be expressed are antisense 
sequences or double-strand sequences relative to the target cell cellular mRNA molecules, and/or 
the agent is suitable for sequence-specific gene silencing, preferably by antisense RNA or RNA 
interference mechanisms. 
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GERD HOBOM ET AL 
INTERNATIONAL APPL. NO.: PCT/EP 00/0 1903 

22. The [use] method according to [claims 18 to 21 claim 18 , wherein the agent[s are] is 
appHcable in ex vivo and in vivo application schemes. 

23. A method for the production of proteins or glycoproteins which comprises utilizing a 
recombinant influenza virus according to [claims 1 to 13] claim 1 as expression vector. 

25. A method for preventing and/or treating influenza which comprises administering an 
effective amount of a recombinant influenza virus according to [claims 1 to 13] claim 1 to the 
mammal to be treated. 

26. A method for somatic gene therapy, which method comprises subjecting the organism to 
be treated with a recombinant influenza virus according to [claims 1 to 13] claim 1 . 

27. A method for transfer and expression of foreign genes into cells, and for transfer and 
expression of RNA molecules into cells, which method comprises infecting the cells with a 
recombinant influenza virus according to [claims 1 to 13] claim 1 . 

28. [Use of| Method of using an agent comprising a recombinant influenza virus according 
to [claims 1 to 13 for preparing agents] claim 1 for autologous immunotherapy. 

29. A method for an immunotherapy which comprises ex vivo infection of immune cells with 
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GERD HOBOM ET AL 
INTERNATFONAL APPL. NO.: PCT/EPOO/0 1 903 

a recombinant influenza virus according to [claims 1 to 13] claim 1 , and introduction of the 
transduced cells into the patient. 

30. A method for the induction of antibodies which comprises utilizing a recombinant 
influenza virus according to [claims 1 to 13] claim 1 as an immunogen. 
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Attorney Docket No.: Kreisler 1098-KGB 
"Express Mail" Label No. EL 867734861 US Client Ref. No.: 0 1 2 1 42us/JH/ml 

Date of Deposit: _ 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



APPLICANTS 

SERIAL NO. 

FILED 

FOR 

ART UNIT 
EXAMINER 



GERD HOBOM, RAMON FLICK, ANETTE MENKE and MAYSA 
AZZEH 

TO BE ASSIGNED (International Application No.: PCT/EPOO/01903) 
HEREWITH 

STABLE RECOMBINANT INFLUENZA VIRUSES FREE OF 
HELPER VIRUSES 

TO BE ASSIGNED 

TO BE ASSIGNED 



August 31, 2001 



Hon. Commissioner of Patents 
Washington, D.C. 20231 



STATEMENT REGARDING THE SEQUENCE LISTING 



SIR: 



Attached is a computer diskette containing a computer readable form of the paper 
Sequence Listing embedded in the specification. The undersigned hereby certifies that the 
paper Sequence Listing does not introduce new matter, and that the content of the paper 
Sequence Listing and the computer readable form copy is the same and includes no new 
matter. 



Early and favorable action is earnestly solicited. 



220 East 42"" Street - 30"^ Floor 
New York, New York 10017 
Phone: (212) 808-0700 
Fax: (212) 808-0700 



Respectfully submitted, 

NORRIS N^tlAUGlCSq & MARCUS, P. A. 





09/914658 
Wf/PfO u 2002 



-7048421 
SEQUENCE LISTING 



<110> Hobom, Gerd 

Flick, Ramon 
Menke, Anette 
Azzeh , Maysa 



<120> Stable Recombinant Influenza Viruses Free of Helper 
vi ruses 

<130> Kreisler 1098 

<140> US 09/914,658 
<141> 2001-08-31 

<140> PCT/EPOO/01903 
<141> 2000-03-03 

<150> EP 99104519 
<151> 1999-03-06 

<160> 26 

<170> Microsoft word XP 

<210> 1 
<211> 4930 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pHL2969 
<400> 1 

cgtacgaagc ttctagaggg attggctgag acgaaaaaca tatgctagag ggattggctg 60 
agacgaaaaa catatgctag agcggccgcc accgcggtgg agctccagct tttgttccct 120 
ttagtgaggg ttaattgcgc gcaggcctag ctaggtaaag aaaaataccc ttgtttctac 180 
taataacccg gcggcccaaa atgccgactc ggagcgaaag atatacctcc cccggggccg 240 
ggaggtcgcg tcaccgacca cgccgccggc ccaggcgacg cgcgacacgg acacctgtcc 300 
ccaaaaacgc caccatcgca gccacacacg gagcgcccgg ggccctctgg tcaaccccag 360 
gacacacgcg ggagcagcgc cgggccgggg acgccctccc ggccgcccgt gccacacgca 420 
gg99gccggc ccgtgtctcc agagcgggag ccggaagcat tttcggccgg cccctcctac 480 
gaccgggaca cacgagggac cgaaggccgg ccaggcgcga cctctcgggc cgcacgcgcg 540 
ctcagggagc gctctccgac tccgcacggg gactcgccag aaaggatcgt gacctgcatt 600 
aatgaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 660 
aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 720 
cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 780 
gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 840 
tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 900 
tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 960 
cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 1020 
gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 1080 
ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt 1140 
ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 1200 
ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 1260 
agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 1320 
aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 1380 
atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 1440 
tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 1500 
tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca 1560 
tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca 1620 
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gcaataaacc agccagccgg aagggccgag 
tccatccagt ctattaattg ttgccgggaa 
ttgcgcaacg ttgttgccat tgctacaggc 
gcttcattca gctccggttc ccaacgatca 
aaaaaagcgg ttagctcctt cggtcctccg 
ttatcactca tggttatggc agcactgcat 
tgcttttctg tgactggtga gtactcaacc 
ccgagttgct cttgcccggc gtcaacacgg 
aaagtgctca tcattggaaa acgttcttcg 
ttgagatcca gttcgatgta acccactcgt 
ttcaccagcg tttctgggtg agcaaaaaca 
agggcgacac ggaaatgttg aatactcata 
tatcagggtt attgtctcat gagcggatac 
aagagtttgt agaaacgcaa aaaggccatc 
cctggcagtt tatggcgggc gtcctgcccg 
caaatccgct cccggcggat ttgtcctact 
aaaacgaaag gcccagtctt tcgactgagc 
ctactctcgc atggggagac cccacactac 
tcggcatggg gtcaggtggg accaccgcgc 
gaccgcttct gcgttctgat ttaatctgta 
aacagaagct agcggccgat ccccaaaaaa 
tggccccgcc gttccgcgcc gggggggggg 
gacctccagc atcgggggaa aaaaaaaaaa 
ccgaagttgg gggggagtag aaacagggta 
ctgactgaaa tgccttgagc aactgactga 
agataatcac tcactgagtg acatccacat 
attgtaccag ggattggctg agacgaaaaa 
atcctggttt tcgcccttgc ggcagtcatc 
catcatgctg tatcaaatgg caccaaagta 
gtcaatgcaa cggaaacagt ggagcggaca 
agaaccactg atcttggcca atgcggactg 
gaccaatttc tagaattttc agctgatcta 
tgttacccgg ggaagtttgt taatgaagag 
gggattgaca aagaaacaat gggattcaca 
agtgcatgta gaagatcagg gtcttcattc 
acagacaatg cttctttccc acaaatgaca 
gctctgatag tctggggaat ccaccattca 
gggagtggaa ataaactgat aacagtcggg 
agtccaggaa cacgaccgca gataaatggc 
atcttggatc ccaatgatac agttactttt 
cgtgccagct tcttgagggg aaagtccatg 
aattgcgaag gggaatgcta ccacagtgga 
aacataaata gcagagcagt tggcaaatgc 
ttggcaactg ggatgaagaa cgttcccgaa 
tttggtgcta tagcagggtt tattgaaaat 
ggtttcaggc atcagaatgc acaaggagaa 
tcggcaattg atcagataac cggaaagtta 
tttgagctaa tagataatga attcactgaa 
tggaccaaag actccatcac agaagtatgg 
gaaaaccagc acactattga tttggctgat 
aggaaacaat taagggaaaa tgctgaagag 
aaatgtgacg atgattgtat ggctagtata 
agagaagaag cgatgcaaaa tagaatacaa 
aaagatgtga tactttggtt tagcttcggg 
atgggccttg ttttcatatg tgtgaagaac 
agcttgcatg 



<210> 2 
<211> 5811 
<212> DNA 

<213> Artificial Sequence 



--7048421 

cgcagaagtg gtcctgcaac tttatccgcc 1680 
gctagagtaa gtagttcgcc agttaatagt 1740 
atcgtggtgt cacgctcgtc gtttggtatg 1800 
aggcgagtta catgatcccc catgttgtgc 1860 
atcgttgtca gaagtaagtt ggccgcagtg 1920 
aattctctta ctgtcatgcc atccgtaaga 1980 
aagtcattct gagaatagtg tatgcggcga 2040 
gataataccg cgccacatag cagaacttta 2100 
gggcgaaaac tctcaaggat cttaccgctg 2160 
gcacccaact gatcttcagc atcttttact 2220 
ggaaggcaaa atgccgcaaa aaagggaata 2280 
ctcttccttt ttcaatatta ttgaagcatt 2340 
atatttgaat gtatttagaa aaataaacaa 2400 
cgtcaggatg gccttctgct taatttgatg 2460 
ccaccctccg ggccgttgct tcgcaacgtt 2520 
caggagagcg ttcaccgaca aacaacagat 2580 
ctttcgtttt atttgatgcc tggcagttcc 2640 
catcggcgct acggcgtttc acttctgagt 2700 
tactgccgcc aggcaaattc tgttttatca 2760 
tcaggctgaa aatcttctct catccgccaa 2820 
aaaaaaaaaa aaaaaaaaaa gagtccagag 2880 
gggggggggg acactttcgg acatctggtc 2940 
caaagtttcg cccggagtac tggtcgacct 3000 
gataatcact cactgacgta cgttgagcaa 3060 
aatgcctgac gtctttagca aaagcagggt 3120 
cgtaccagga ttggctgaga cgaaaaacat 3180 
catattgtag gtaccaaaat gaacactcaa 3240 
cccacaaatg cagacaaaat ttgtcttgga 3300 
aacacactca ctgagagagg agtagaagtt 3360 
aacatcccca aaatttgctc aaaagggaaa 3420 
ttagggacca ttaccggacc acctcaatgc 3480 
ataatcgaga gacgagaagg aaatgatgtt 3540 
gcattgcgac aaatcctcag aggatcaggt 3600 
tatagtggaa taaggaccaa cggaacaact 3660 
tatgcagaaa tggagtggct cctgtcaaat 3720 
aaatcataca aaaacacagg gagagaatca 3780 
ggatcaacca ccgaacagac caaactatat 3840 
agttccaaat atcatcaatc ttttgtgccg 3900 
cggtccggac ggattgattt tcattggttg 3960 
agtttcaatg gggctttcat agctccaaat 4020 
gggatccaga gcgatgtgca ggttgatgct 4080 
gggactataa caagcagatt gccttttcaa 4140 
ccaagatatg taaaacagga aagtttatta 4200 
ccttccaaaa aaaggaaaaa aagaggcctg 4260 
ggttgggaag gtctggtcga cgggtggtac 4320 
ggaactgcag cagactacaa aagcacccaa 4380 
aatagactca ttaagaaaac caaccagcaa 4440 
gtggaaaagc agattggcaa tttaattaac 4500 
tcttacaatg ctgaacttct tgtggcaatg 4560 
tcagagatga acaagctgta tgagcgagtg 4620 
gatggcactg gttgctttga aatttttcat 4680 
aggaacaata cttatgatca cagcaaatac 4740 
attgacccag tcaaattgag tagtggctac 4800 
gcatcatgct ttttgcttct tgccattgca 4860 
ggaaacatgc ggtgcactat atgcatttaa 4920 

4930 
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<220> 



<223> Description of Artificial Sequence: pAM403 
<400> 2 

aattcctttg cctaatttaa atgaggactt aacctgtgga aatattttga tgtgggaagc 60 
tgttactgtt aaaactgagg ttattggggt aactgctatg ttaaacttgc attcagggac 120 
acaaaaaact catgaaaatg gtgctggaaa acccattcaa gggtcaaatt ttcatttttt 180 
tgctgttggt ggggaacctt tggagctgca gggtgtgtta gcaaactaca ggaccaaata 240 
tcctgctcaa actgtaaccc caaaaaatgc tacagttgac agtcagcaga tgaacactqa 300 
ccacaaggct gttttggata aggataatgc ttatccagtg gagtgctggg ttcctqatcc 360 
aagtaaaaat gaaaacacta gatattttgg aacctacaca ggtggggaaa atgtgcctcc 420 
tgttttgcac attactaaca cagcaaccac agtgcttctt gatgagcagg gtgttqgqcc 480 
cttgtgcaaa gctgacagct tgtatgtttc tgctgttgac atttgtgggc tgtttaccaa 540 
cacttctgga acacagcagt ggaagggact tcccagatat tttaaaatta cccttagaaa 600 
gcggtctgtg aaaaacccct acccaatttc ctttttgtta agtgacctaa ttaacaqqaq 660 
gacacagagg gtggatgggc agcctatgat tggaatgtcc tctcaagtag aggagqttaq 720 
ggtttatgag gacacagagg agcttcctgg ggatccagac atgataagat acattgatga 780 
gtttggacaa accacaacta gaatgcagtg aaaaaaatgc tttatttgtg aaatttqtqa 840 
tgctattgct ttatttgtaa ccattataag ctgcaataaa caagttaaca acaacaattq 900 
cattcatttt atgtttcagg ttcaggggga ggtgtgggag gttttttaaa gcaagtaaaa 960 
"^JJ^^^^^ tgtggtatgg ctgattatga tctctagtca aggcactata catcaaatat 1020 
tccttattaa cccctttaca aattaaaaag ctaaaggtac acaatttttg agcatagtta 1080 
ttaatagcag acactctatg cctgtgtgga gtaagaaaaa acagtatgtt atgattataa 1140 
ctgttatgcc tacttataaa ggttacagaa tatttttcca taattttctt gtatagcaqt 1200 
gcagcttttt cctttgtggt gtaaatagca aagcaagcaa gagttctatt actaaacaca 1260 
aaaaacttag caattctgaa ggaaagtcct tggggtcttc tacctttctc 1320 
gaggagtaga atgttgagag tcagcagtag cctcatcatc actaqatqqc 1380 
atttcttctg agcaaaacag gttttcctca ttaaaggcat tccaccactg ctcccattca 1440 
tcagttccat aggttggaat ctaaaataca caaacaatta gaatcagtag tttaacacat 1500 
tatacactta aaaattttat atttacctta gagctttaaa tctctgtagg tagtttgtcc 1560 
aattatgtca caccacagaa gtaaggttcc ttcacaaaga tccgggacca aagcggccat 1620 
cgtgcctccc cactcctgca gttcgggggc atggatgcgc ggatagccgc tgctggtttc 1680 
ctggatgccg acggatttgc actgccggta gaactccgcg aggtcgtcca gcctcaggca 1740 
gcagctgaac caactcgcga ggggatcgag cccggggtgg gcgaagaact ccagcatgag 1800 
atccccgcgc tggaggatca tccagccggc gtcccggaaa acgattccga agcccaacct 1860 
ttcatagaag gcggcggtgg aatcgaaatc tcgtgatggc aggttgggcg tcgcttgqtc 1920 
ggtcatttcg atgaattcga gctcggtacc cggggatcct ctagaggcat ttcagtttcg 1980 
tcctcacgga ctcatcagag ttgctcaatt cgaaccccag agtcccgctc agaagaactc 2040 
gtcaagaagg cgatagaagg cgatgcgctg cgaatcggga gcggcgatac cgtaaagcac 2100 
gaggaagcgg tcagcccatt cgccgccaag ctcttcagca atatcacggg tagccaacgc 2160 
tagcggtccg ccacacccag ccggccacag tcgatgaatc cagaaaagcg 2220 
gccattttcc accatgatat tcggcaagca ggcatcgcca tgggtcacga cgagatcctc 2280 
gccgtcgggc atgcgcgcct tgagcctggc gaacagttcg gctggcgcga gcccctgatg 2340 
ctcttcgtcc agatcatcct gatcgacaag accggcttcc atccgagtac gtgctcgctc 2400 
gatgcgatgt ttcgcttggt ggtcgaatgg gcaggtagcc ggatcaagcg tatgcaqccq 2460 
ccgcattgca tcagccatga tggatacttt ctcggcagga gcaaggtgag atgacaggag 2520 
atcctgcccc ggcacttcgc ccaatagcag ccagtccctt cccgcttcag tgacaacgtc 2580 
gagcacagct gcgcaaggaa cgcccgtcgt ggccagccac gatagccgcg ctgcctcgtc 2640 
ctgcagttca ttcagggcac cggacaggtc ggtcttgaca aaaagaaccg ggcgcccctg 2700 
cgctgacagc cggaacacgg cggcatcaga gcagccgatt gtctgttgtg cccagtcata 2760 
gccgaatagc ctctccaccc aagcggccgg agaacctgcg tgcaatccat cttgttcaat 2820 
catgcgaaac gatcctcatc ctgtctcttg atcagatctt gatcccctgc gccatcagat 2880 
ccttggcggc aagaaagcca tccagtttac tttgcagggc ttcccaacct taccagaggg 2940 
cgccccagct ggcaattccg gttcgcttgc tgtccataaa accgcccagt ctagctatcg 3000 
ccatgtaagc ccactgcaag ctacctgctt tctctttgcg cttgcgtttt cccttgtcca 3060 
gatagcccag tagctgacat tcatccgggg tcagcaccgt ttctgcggac tggctttcta 3120 
cgtgttccgc ttcctttagc agcccttgcg ccctgagtgc ttgcggcagc gtgaagcttt 3180 
ttgcaaaagc ctaggcctcc aaaaaagcct cctcactact tctggaatag ctcagaggcc 3240 
gaggcggcct cggcctctgc ataaataaaa aaaattagtc agccatgggg cggagaatgg 3300 
gcggaactgg gcggagttag gggcgggatg ggcggagtta ggggcgggac tatggttgct 3360 
gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg ggactttcca 3420 
cacctggttg ctgactaatt gagatgcatg ctttgcatac ttctgcctgc tggggagcct 3480 
ggggactttc cacaccctaa ctgacacaca ttccacagct gcctcgcgcg tttcggtgat 3540 
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gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg tctgtaagcg 3600 
gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc 3660 
gcagccatga cccagtcacg tagcgatagc ggagtgtata ctggcttaac tatgcggcat 3720 
cagagcagat tgtactgaga gtgcaccata tgcggtgtga aataccgcac agatgcgtaa 3780 
ggagaaaata ccgcatcagg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg 3840 
tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 3900 
aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 3960 
gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 4020 
aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 4080 
ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 4140 
tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 4200 
tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 4260 
ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 4320 
tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 4380 
ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta 4440 
tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 4500 
aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 4560 
aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 4620 
aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 4680 
ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 4740 
acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 4800 
ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 4860 
gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 4920 
taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 4980 
tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 5040 
gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 5100 
cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 5160 
aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 5220 
cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 5280 
tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 5340 
gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag 5400 
tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 5460 
gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 5520 
ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 5580 
cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 5640 
agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 5700 
gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca 5760 
tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa g 5811 



<210> 3 
<211> 2005 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vHM41 
<400> 3 

agtagaaaca agggtatttt tctttaccta gctaggcctg cgcgcaatta accctcacta 60 
aagggaacaa aagctggagc tccaccgcgg tggcggccgc tctagcatat gtttttcgtc 120 
tcagccaatc cctctagcat atgtttttcg tctcagccaa tccctctaga agcttcgtac 180 
gcatgcaagc tttaaatgca tatagtgcac cgcatgtttc cgttcttcac acatatgaaa 240 
acaaggccca ttgcaatggc aagaagcaaa aagcatgatg ccccgaagct aaaccaaagt 300 
atcacatctt tgtagccact actcaatttg actgggtcaa tttgtattct attttgcatc 360 
gcttcttctc tgtatttgct gtgatcataa gtattgttcc ttatactagc catacaatca 420 
tcgtcacatt tatgaaaaat ttcaaagcaa ccagtgccat cctcttcagc attttccctt 480 
aattgtttcc tcactcgctc atacagcttg ttcatctctg aatcagccaa atcaatagtg 540 
tgctggtttt ccattgccac aagaagttca gcattgtaag accatacttc tgtgatggag 600 
tctttggtcc agttaattaa attgccaatc tgcttttcca cttcagtgaa ttcattatct 660 
attagctcaa attgctggtt ggttttctta atgagtctat ttaactttcc ggttatctga 720 
tcaattgccg attgggtgct tttgtagtct gctgcagttc cttctccttg tgcattctga 780 
tgcctgaaac cgtaccaccc gtcgaccaga ccttcccaac cattttcaat aaaccctgct 840 
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atagcaccaa 
ccagttgcca 
ctatttatgt 
ccttcgcaat 
aagctggcac 
ggatccaaga 
gttcctggac 
tttccactcc 
actatcagag 
gcattgtctg 
ctacatgcac 
ttgtcaatcc 
cccgggtaac 
agaaattggt 
tcagtggttc 
gttgcattga 
acagcatgat 
aaaaccagga 
cctggtacaa 
agtgattatc 



-7048421 

acaggcctct ttttttcctt tttttggaag gttcgggaac gttcttcatc 900 
ataataaact ttcctgtttt acatatcttg ggcatttgcc aactgctctg 960 
tttgaaaagg caatctgctt gttatagtcc ctccactgtg gtagcattcc 1020 
tagcatcaac ctgcacatcg ctctggatcc ccatggactt tcccctcaag 1080 
gatttggagc tatgaaagcc ccattgaaac taaaagtaac tgtatcattg 1140 
tcaaccaatg aaaatcaatc cgtccggacc ggccatttat ctgcggtcgt 1200 
tcggcacaaa agattgatga tatttggaac tcccgactgt tatcagttta 1260 
catatagttt ggtctgttcg gtggttgatc ctgaatggtg gattccccag 1320 
ctgattctct ccctgtgttt ttgtatgatt ttgtcatttg tgggaaagaa 1380 
tatttgacag gagccactcc atttctgcat agaatgaaga ccctgatctt 1440 
tagttgttcc gttggtcctt attccactat atgtgaatcc cattgtttct 1500 
cacctgatcc tctgaggatt tgtcgcaatg cctcttcatt aacaaacttc 1560 
aaacatcatt tccttctcgt ctctcgatta ttagatcagc tgaaaattct 1620 
cgcattgagg tggtccggta atggtcccta acagtccgca ttggccaaga 1680 
ttttcccttt tgagcaaatt ttggggatgt ttgtccgctc cactgtttcc 1740 
caacttctac tcctctctca gtgagtgtgt ttactttggt gccatttgat 1800 
gtccaagaca aattttgtct gcatttgtgg ggatgactgc cgcaagggcg 1860 
tttgagtgtt cattttggta cctacaatat gtttttcgtc tcagccaatc 1920 
tatgtttttc gtctcagcca atcctggtac gatgtggatg tcactcaqtg 1980 
taccctgctt ttgct ^ ^ 2005 



<210> 4 
<211> 1146 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vHMSl 

<400> 4 
agtagaaaca 
aagggaacaa 
tcagccaatc 
gcatgcttaa 
ttgtaaggct 
tctcacttct 
ttttccgttt 
cgaatctctg 
tgaggacccc 
gtgaaatttc 
ccagccgatc 
cctggtccat 
agtcccttga 
caatggtcat 
ttccaacacg 
acttctgatc 
atcgtttgcg 
ccattatggt 
cgtctcagcc 
tttgct 



agggtatttt 
aagctggagc 
cctctagcat 
ataagctgaa 
tgcataaatg 
tcaattaacc 
ctgctttgga 
tatagtttca 
aattgcattt 
gccgacaatt 
gaaaatcaca 
tctgatgcaa 
catctcttca 
tttaagtgcc 
agtagctgtg 
tcggcgaagt 
gacatgccaa 
acctacaata 
aatcctggta 



tctttaccta 
tccaccgcgg 
atgtttttcg 
acgagaaagt 
ttatttgttc 
atcttatttc 
gggagtggag 
gagactcgaa 
ttgacatcct 
gctccctcat 
ctgaagtttg 
agggagcctg 
agagtcatgt 
tcatcggatt 
tcgatgtcca 
cggtcaagga 
agaaagcagt 
tgtttttcgt 
cgatgtggat 



gctaggcctg 
tggcggccgc 
tctcagccaa 
tcttatctct 
aaaactattc 
ttcaaatttc 
gtcccccatt 
ctgtgttatc 
catcagtatg 
cggttaaagc 
ctttcagtat 
ccactttctg 
cagttaggta 
cgtcctccag 
gaccaagagt 
atggggcatc 
ctacctgaaa 
ctcagccaat 
gtcactcagt 



cgcgcaatta 
tctagcatat 
tccctctaga 
tgctccactt 
tctgttatct 
tgactcaatt 
ctcattactg 
attccattca 
tcctggaaga 
ccttaatagt 
gatgttcttc 
tttgggcatg 
gcgtgtagca 
aatccgctcc 
gctgcctctt 
acccatttct 
gcttgacaca 
ccctggtaca 
gagtgattat 



accctcacta 
gtttttcgtc 
agcttcgtac 
caagcggtag 
tcaatctatg 
gttctcgcca 
cttctccaag 
agtcctccga 
gaaggcaatg 
atgagagttt 
cccatgatcg 
agcatgaacc 
ggtacagagg 
actatctgct 
cccctcaggg 
tggtctgcaa 
gtgttggaat 
atatgttttt 
ctaccctgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1146 



<210> 5 
<211> 5860 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pAM424 
<400> 5 

catcgattgg ctgactgatg agtccgtgag gacgaaacga aaaacatatt gtagagctcg 60 
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aattcatcga 
ccgccgcctt 
tcctccagcg 
cgagttggtt 
aatccgtcgg 
aggagtgggg 
tgtggtgtga 
aaatttttaa 
ccaacctatg 
ttttgctcag 
actcctccaa 
agttttttga 
acaaaggaaa 
ataagtaggc 
agagtgtctg 
aaaggggtta 
taccacattt 
gaaacataaa 
caaataaagc 
ttgtggtttg 
ctgtgtcctc 
catccaccct 
ggtttttcac 
gctgtgttcc 
tgtcagcttt 
tagtaatgtg 
tgttttcatt 
ccaaaacagc 
ttacagtttg 
gttccccacc 
tttcatgagt 
cagttttaac 
aattaggcaa 
taatgtcatg 
cggaacccct 
ataaccctga 
ccgtgtcgcc 
aacgctggtg 
actggatctc 
gatgagcact 
agagcaactc 
cacagaaaag 
catgagtgat 
aaccgctttt 
gctgaatgaa 
aacgttgcgc 
agactggatg 
ctggtttatt 
actggggcca 
aactatggat 
gtaactgtca 
atttaaaagg 
tgagttttcg 
tccttttttt 
ggtttgtttg 
agcgcagata 
ctctgtagca 
tggcgataag 
gcggtcgggc 
cgaactgaga 
ggcggacagg 
agggggaaac 
tcgatttttg 



aatgaccgac 
ctatgaaagg 
cggggatctc 
cagctgctgc 
catccaggaa 
aggcacgatg 
cataattgga 
gtgtataatg 
gaactgatga 
aagaaatgcc 
aaaagaagag 
gtcatgctgt 
aagctgcact 
ataacagtta 
ctattaataa 
ataaggaata 
gtagaggttt 
atgaatgcaa 
aatagcatca 
tccaaactca 
ataaacccta 
ctgtgtcctc 
agaccgcttt 
agaagtgttg 
gcacaagggc 
caaaacagga 
tttacttgga 
cttgtggtca 
agcaggatat 
aacagcaaaa 
tttttgtgtc 
agtaacagct 
aggaattctt 
ataataatgg 
atttgtttat 
taaatgcttc 
cttattccct 
aaagtaaaag 
aacagcggta 
tttaaagttc 
ggtcgccgca 
catcttacgg 
aacactgcgg 
ttgcacaaca 
gccataccaa 
aaactattaa 
gaggcggata 
gctgataaat 
gatggtaagc 
gaacgaaata 
gaccaagttt 
atctaggtga 
ttccactgag 
ctgcgcgtaa 
ccggatcaag 
ccaaatactg 
ccgcctacat 
tcgtgtctta 
tgaacggggg 
tacctacagc 
tatccggtaa 
gcctggtatc 
tgatgctcgt 



caagcgacgc 
ttgggcttcg 
atgctggagt 
ctgaggctgg 
accagcagcg 
gccgctttgg 
caaactacct 
tgttaaacta 
atgggagcag 
atctagtgat 
aaaggtagaa 
gtttagtaat 
gctatacaag 
taatcataac 
ctatgctcaa 
tttgatgtat 
tacttgcttt 
ttgttgttgt 
caaatttcac 
tcaatgtatc 
acctcctcta 
ctgttaatta 
ctaagggtaa 
gtaaacagcc 
ccaacaccct 
ggcacatttt 
tcaggaaccc 
gtgttcatct 
ttggtcctgt 
aaatgaaaat 
cctgaatgca 
tcccacatca 
gaagacgaaa 
tttcttagac 
ttttctaaat 
aataatattg 
tttttgcggc 
atgctgaaga 
agatccttga 
tgctatgtgg 
tacactattc 
atggcatgac 
ccaacttact 
tgggggatca 
acgacgagcg 
ctggcgaact 
aagttgcagg 
ctggagccgg 
cctcccgtat 
gacagatcgc 
actcatatat 
agatcctttt 
cgtcagaccc 
tctgctgctt 
agctaccaac 
tccttctagt 
acctcgctct 
ccgggttgga 
gttcgtgcac 
gtgagctatg 
gcggcagggt 
tttatagtcc 
caggggggcg 



-7048421 
ccaacctgcc atcacgagat 
gaatcgtttt ccgggacgcc 
tcttcgccca ccccgggctc 
acgacctcgc ggagttctac 
gctatccgcg catccatgcc 
tcccggatct ttgtgaagga 
acagagattt aaagctctaa 
ctgattctaa ttgtttgtgt 
tggtggaatg cctttaatga 
gatgaggcta ctgctgactc 
gaccccaagg actttccttc 
agaactcttg cttgctttgc 
aaaattatgg aaaaatattc 
atactgtttt ttcttactcc 
aaattgtgta cctttagctt 
agtgccttga ctagagatca 
aaaaaacctc ccacacctcc 
taacttgttt attgcagctt 
aaataaagca tttttttcac 
ttatcatgtc tggatcccca 
cttgagagga cattccaatc 
ggtcacttaa caaaaaggaa 
ttttaaaata tctgggaagt 
cacaaatgtc aacagcagaa 
gctcatcaag aagcactgtg 
ccccacctgt gtaggttcca 
agcactccac tggataagca 
gctgactgtc aactgtagca 
agtttgctaa cacaccctgc 
ttgacccttg aatgggtttt 
agtttaacat agcagttacc 



aaatatttcc acaggttaag 
gggcctcgtg atacgcctat 
gtcaggtggc acttttcggg 
acattcaaat atgtatccgc 
aaaaaggaag agtatgagta 
attttgcctt cctgtttttg 
tcagttgggt gcacgagtgg 
gagttttcgc cccgaagaac 
cgcggtatta tcccgtgttg 
tcagaatgac ttggttgagt 



agtaagagaa ttatgcagtg 
tctgacaacg atcggaggac 
tgtaactcgc cttgatcgtt 
tgacaccacg atgcctgcag 
acttactcta gcttcccggc 
accacttctg cgctcggccc 
tgagcgtggg tctcgcggta 
cgtagttatc tacacgacgg 
tgagataggt gcctcactga 
actttagatt gatttaaaac 
tgataatctc atgaccaaaa 
cgtagaaaag atcaaaggat 
gcaaacaaaa aaaccaccgc 
tctttttccg aaggtaactg 
gtagccgtag ttaggccacc 
gctaatcctg ttaccagtgg 
ctcaagacga tagttaccgg 
acagcccagc ttggagcgaa 
agaaagcgcc acgcttcccg 
cggaacagga gagcgcacga 
tgtcgggttt cgccacctct 
gagcctatgg aaaaacgcca 
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ttcgattcca 
ggctggatga 
gatcccctcg 
cggcagtgca 
cccgaactgc 
accttacttc 
ggtaaatata 
attttagatt 
ggaaaacctg 
tcaacattct 
agaattgcta 
tatttacacc 
tgtaaccttt 
acacaggcat 
tttaatttgt 
taatcagcca 
ccctgaacct 
ataatggtta 
tgcattctag 
ggaagctcct 
ataggctgcc 
attgggtagg 
cccttccact 
acatacaagc 
gttgctgtgt 
aaatatctag 
ttatccttat 
ttttttgggg 
agctccaaag 
ccagcaccat 
ccaataacct 
tcctcattta 
ttttataggt 
gaaatgtgcg 
tcatgagaca 
ttcaacattt 
ctcacccaga 
gttacatcga 
gttttccaat 
acgccgggca 
actcaccagt 
ctgccataac 
cgaaggagct 
gggaaccgga 
caatggcaac 
aacaattaat 
ttccggctgg 
tcattgcagc 
ggagtcaggc 
ttaagcattg 
ttcattttta 
tcccttaacg 
cttcttgaga 
taccagcggt 
gcttcagcag 
acttcaagaa 
ctgctgccag 
ataaggcgca 
cgacctacac 
aagggagaaa 
gggagcttcc 
gacttgagcg 
gcaacgcggc 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 3900 

ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 3960 

ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc tgatgcggta 4020 

ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc tcagtacaat 4080 

ctgctctgat gccgcatagt taagccagta tacactccgc tatcgctacg tgactgggtc 4140 

atggctgcgc cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc 4200 

ccggcatccg cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt 4260 

tcaccgtcat caccgaaacg cgcgaggcag ctgtggaatg tgtgtcagtt agggtgtgga 4320 

aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca 4380 

accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag catgcatctc 4440 

aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct aactccgccc 4500 

agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 4560 

gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc 4620 

ttttgcaaaa agcttcacgc tgccgcaagc actcagggcg caagggctgc taaaggaagc 4680 

ggaacacgta gaaagccagt ccgcagaaac ggtgctgacc ccggatgaat gtcagctact 4740 

gggctatctg gacaagggaa aacgcaagcg caaagagaaa gcaggtagct tgcagtgggc 4800 

ttacatggcg atagctagac tgggcggttt tatggacagc aagcgaaccg gaattgccag 4860 

ctggggcgcc ctctggtaag gttgggaagc cctgcaaagt aaactggatg gctttcttgc 4920 

cgccaaggat ctgatggcgc aggggatcaa gatctgatca agagacagga tgaggatcgt 4980 

ttcgcatgat tgaacaagat ggattgcacg caggttctcc ggccgcttgg gtggagaggc 5040 

tattcggcta tgactgggca caacagacaa tcggctgctc tgatgccgcc gtgttccggc 5100 

tgtcagcgca ggggcgcccg gttctttttg tcaagaccga cctgtccggt gccctgaatg 5160 

aactgcagga cgaggcagcg cggctatcgt ggctggccac gacgggcgtt ccttgcgcag 5220 

ctgtgctcga cgttgtcact gaagcgggaa gggactggct gctattgggc gaagtgccgg 5280 

ggcaggatct cctgtcatct caccttgctc ctgccgagaa agtatccatc atggctgatg 5340 

caatgcggcg gctgcatacg cttgatccgg ctacctgccc attcgaccac caagcgaaac 5400 

atcgcatcga gcgagcacgt actcggatgg aagccggtct tgtcgatcag gatgatctgg 5460 

acgaagagca tcaggggctc gcgccagccg aactgttcgc caggctcaag gcgcgcatgc 5520 

ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg cttgccgaat atcatggtgg 5580 

aaaatggccg cttttctgga ttcatcgact gtggccggct gggtgtggcg gaccgctatc 5640 

aggacatagc gttggctacc cgtgatattg ctgaagagct tggcggcgaa tgggctgacc 5700 

gcttcctcgt gctttacggt atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc 5760 

ttcttgacga gttcttctga gcgggactct ggggttcgaa tcctaccagg gattggctga 5820 

ctgatgagtc cgtgaggacg aaacgaaaaa catatggtac 5860 



<210> 6 
<211> 4610 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial sequence: pHL2507 



<400> 6 

gaggcatttc 

cttgcggcag 

aatggcacca 

acagtggagc 

ggccaatgcg 

ttttcagctg 

tttgttaatg 

acaatgggat 

tcagggtctt 

ttcccacaaa 

ggaatccacc 

ctgataacag 

ccgcagataa 

gatacagtta 

aggggaaagt 

tgctaccaca 

gcagttggca 



agtcagttgc 
tcatccccac 
aagtaaacac 
ggacaaacat 
gactgttagg 
atctaataat 
aagaggcatt 
tcacatatag 
cattctatgc 
tgacaaaatc 
attcaggatc 
tcgggagttc 
atggccggtc 
cttttagttt 
ccatggggat 
gtggagggac 
aatgcccaag 



tcaaggtacc 
aaatgcagac 
actcactgag 
ccccaaaatt 
gaccattacc 
cgagagacga 
gcgacaaatc 
tggaataagg 
agaaatggag 
atacaaaaac 
aaccaccgaa 
caaatatcat 
cggacggatt 
caatggggct 
ccagagcgat 
tataacaagc 
atatgtaaaa 



aaaatgaaca 
aaaatttgtc 
agaggagtag 
tgctcaaaag 
ggaccacctc 
gaaggaaatg 
ctcagaggat 
accaacggaa 
tggctcctgt 
acagggagag 
cagaccaaac 
caatcttttg 
gattttcatt 
ttcatagctc 
gtgcaggttg 
agattgcctt 
caggaaagtt 
Page 



ctcaaatcct 
ttggacatca 
aagttgtcaa 
ggaaaagaac 
aatgcgacca 
atgtttgtta 
caggtgggat 
caactagtgc 
caaatacaga 
aatcagctct 
tatatgggag 
tgccgagtcc 
ggttgatctt 
caaatcgtgc 
atgctaattg 
ttcaaaacat 
tattattggc 
7 



ggttttcgcc 
tgctgtatca 
tgcaacggaa 
cactgatctt 
atttctagaa 
cccggggaag 
tgacaaagaa 
atgtagaaga 
caatgcttct 
gatagtctgg 
tggaaataaa 
aggaacacga 
ggatcccaat 
cagcttcttg 
cgaaggggaa 
aaatagcaga 
aactgggatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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aagaacgttc ccgaaccttc caaaaaaagg aaaaaaagag gcctgtttgg tgctatagca 1080 
gggtttattg aaaatggttg ggaaggtctg gtcgacgggt ggtacggttt caggcatcag 1140 
aatgcacaag gagaaggaac tgcagcagac tacaaaagca cccaatcggc aattgatcag 1200 
ataaccggaa agttaaatag actcattaag aaaaccaacc agcaatttga gctaatagat 1260 
aatgaattca ctgaagtgga aaagcagatt ggcaatttaa ttaactggac caaagactcc 1320 
atcacagaag tatggtctta caatgctgaa cttcttgtgg caatggaaaa ccagcacact 1380 
attgatttgg ctgattcaga gatgaacaag ctgtatgagc gagtgaggaa acaattaagg 1440 
gaaaatgctg aagaggatgg cactggttgc tttgaaattt ttcataaatg tgacgatgat 1500 
tgtatggcta gtataaggaa caatacttat gatcacagca aatacagaga agaagcgatg 1560 
caaaatagaa tacaaattga cccagtcaaa ttgagtagtg gctacaaaga tgtgatactt 1620 
tggtttagct tcggggcatc atgctttttg cttcttgcca ttgcaatggg ccttgttttc 1680 
atatgtgtga agaacggaaa catgcggtgc actatttgta tataggtttg gaaaaaaaca 1740 
ccccttgttt ctactccccc ccaacttcgg aggtcgacca gtactccggg cgaaactttg 1800 
tttttttttt ttcccccgat gctggaggtc gaccagatgt ccgaaagtgt cccccccccc 1860 
cccccccccc ggcgcggaac ggcggggcca ctctggactc tttttttttt tttttttttt 1920 
ttttttgggg atcggccgct agcttctgtt ttggcggatg agagaagatt ttcagcctga 1980 
tacagattaa atcagaacgc agaagcggtc tgataaaaca gaatttgcct ggcggcagta 2040 
gcgcggtggt cccacctgac cccatgccga actcagaagt gaaacgccgt agcgccgatg 2100 
gtagtgtggg gtctccccat gcgagagtag ggaactgcca ggcatcaaat aaaacgaaag 2160 
gctcagtcga aagactgggc ctttcgtttt atctgttgtt tgtcggtgaa cgctctcctg 2220 
agtaggacaa atccgccggg agcggatttg aacgttgcga agcaacggcc cggagggtgg 2280 
cgggcaggac gcccgccata aactgccagg catcaaatta agcagaaggc catcctgacg 2340 
gatggccttt ttgcgtttct acaaactctt ttgtttattt ttctaaatac attcaaatat 2400 
gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag 2460 
tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc 2520 
tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc 2580 
acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc 2640 
cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc 2700 
ccgtgttgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt 2760 
ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt 2820 
atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat 2880 
cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct 2940 
tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat 3000 
gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc 3060 
ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg 3120 
ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc 3180 
tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta 3240 
cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc 3300 
ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga 3360 
tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat 3420 
gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat 3480 
caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa 3540 
accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 3600 
ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt 3660 
aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt 3720 
accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata 3780 
gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt 3840 
ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac 3900 
gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga 3960 
gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg 4020 
ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatgqaa 4080 
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat 4140 
gttctttcct gcgttatccc ctgattcatt aatgcaggtc acgatccttt ctggcgagtc 4200 
cccgtgcgga gtcggagagc gctccctgag cgcgtgcggc ccgagaggtc gcgcctggcc 4260 
ggccttcggt ccctcgtgtg tcccggtcgt aggaggggcc ggccgaaaat gcttccggct 4320 
cccgctctgg agacacgggc cggccccctg cgtgtggcac gggcggccgg gagggcgtcc 4380 
ccggcccggc gctgctcccg cgtgtgtcct ggggttgacc agagggcccc gggcgctccg 4440 
tgtgtggctg cgatggtggc gtttttgggg acaggtgtcc gtgtccgtgt cgcgcgtcgc 4500 
ctgggccggc ggcgtggtcg gtgacgcgac ctcccggccc cgggggaggt atatctttcg 4560 
ctccgagtcg gcattttggg ccgccgggtt attagtagaa acaggggtac 4610 



<210> 7 
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<211> 3558 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: pHL2583 
<400> 7 

tattagtaga aacagggtat tttttattct agtacattac gccccgccct gccactcatc 60 
gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 120 
atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 180 
ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 240 
actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 300 
ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 360 
atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 420 
gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacggaa 480 
ttccggatga gcattcatca ggcgggcaag aatgtgaata aaggccggat aaaacttgtg 540 
cttatttttc tttacggtct ttaaaaaggc cgtaatatcc agctgaacgg tctggttata 600 
ggtacattga gcaactgact gaaatgcctc aaaatgttct ttacgatgcc attgggatat 660 
atcaacggtg gtatatccag tgattttttt ctccatgatt atggccatta cccttgtttc 720 
tactcccccc caacttcgga ggtcgaccag tactccgggc gaaactttgt tttttttttt 780 
tcccccgatg ctggaggtcg accagatgtc cgaaagtgtc cccccccccc cccccccccg 840 
gcgcggaacg gcggggccac tctggactct tttttttttt tttttttttt tttttgggga 900 
tcggccgcta gcttctgttt tggcggatga gagaagattt tcagcctgat acagattaaa 960 
tcagaacgca gaagcggtct gataaaacag aatttgcctg gcggcagtag cgcggtggtc 1020 
ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta gcgccgatgg tagtgtgggg 1080 
tctccccatg cgagagtagg gaactgccag gcatcaaata aaacgaaagg ctcagtcgaa 1140 
agactgggcc tttcgtttta tctgttgttt gtcggtgaac gctctcctga gtaggacaaa 1200 
tccgccggga gcggatttga acgttgcgaa gcaacggccc ggagggtggc gggcaggacg 1260 
cccgccataa actgccaggc atcaaattaa gcagaaggcc atcctgacgg atggcctttt 1320 
tgcgtttcta caaactcttt tgtttatttt tctaaataca ttcaaatatg tatccgctca 1380 
tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 1440 
aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 1500 
acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 1560 
acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 1620 
ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtgttgacg 1680 
ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 1740 
caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 1800 
ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 1860 
aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 1920 
aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 1980 
tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 2040 
aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 2100 
cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 2160 
ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 2220 
gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 2280 
agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 2340 
atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 2400 
cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 2460 
cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 2520 
cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 2580 
tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 2640 
tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 2700 
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 2760 
aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 2820 
cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag 2880 
ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 2940 
agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 3000 
ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 3060 
acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 3120 
cgttatcccc tgattcatta atgcaggtca cgatcctttc tggcgagtcc ccgtgcggag 3180 
tcggagagcg ctccctgagc gcgtgcggcc cgagaggtcg cgcctggccg gccttcggtc 3240 
cctcgtgtgt cccggtcgta ggaggggccg gccgaaaatg cttccggctc ccgctctgga 3300 
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gacacgggcc ggccccctgc gtgtggcacg ggcggccggg agggcgtccc cggcccggcg 3360 
ctgctcccgc gtgtgtcctg gggttgacca gagggccccg ggcgctccgt gtgtggctgc 3420 
gatggtggcg tttttgggga caggtgtccg tgtccgtgtc gcgcgtcgcc tgggccggcg 3480 
gcgtggtcgg tgacgcgacc tcccggcccc gggggaggta tatctttcgc tccqaqtcqq 3540 
cattttgggc cgccgggt y y yy 



<210> 8 
<211> 4343 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: pHL2989 

<400> 8 
ctttctggcg 
ggtcgcgcct 
aaatgcttcc 
ccgggagggc 
ccccgggcgc 
gtgtcgcgcg 
aggtatatct 
tattttttat 
ctggagttgt 
tcagtggaga 
ctactggaaa 
aatgcttttc 
ccgaaggtta 
gtgctgaagt 
attttaaaga 
atgtatacat 
acaacatcga 
gcgatggccc 
aagatcccaa 
ttacacatgg 
catggtctag 



attcattaag 
ccagcggcat 
cgaagaagtt 
tggcactcac 
cgtaacacgc 
cactccagag 
cactatccca 
tcatcaggcg 
cggtctttaa 
ctgactgaaa 
atccagtgat 
acttcggagg 
ggaggtcgac 
ggggccactc 
ttctgttttg 
agcggtctga 
atgccgaact 
agagtaggga 
tcgttttatc 
ggatttgaac 
tgccaggcat 
aactcttttg 
cctgataaat 
tcgcccttat 
tggtgaaagt 



agtccccgtg 
ggccggcctt 
ggctcccgct 
gtccccggcc 
tccgtgtgtg 
tcgcctgggc 
ttcgctccga 
actagtaagc 
cccaattctt 
gggtgaaggt 
actacctgtt 
aagataccca 
tgtacaggaa 
caagtttgaa 
^Qatggaaac 
catggctgac 
ggacggcggc 
tgtcctttta 
cgaaaagaga 
catggatgaa 
atatctagta 

cattctgccg 
cagcaccttg 
gtccatattg 
aaagaacatg 
cacatcttgc 
cgatgaaaac 
tatcaccagc 
ggcaagaatg 
aaaggccgta 
tgcctcaaaa 
ttttttctcc 
tcgaccagta 
cagatgtccg 
tggactcttt 
gcggatgaga 
taaaacagaa 
cagaagtgaa 
actgccaggc 
tgttgtttgt 
gttgcgaagc 
caaattaagc 
tttatttttc 
gcttcaataa 
tccctttttt 
aaaagatgct 



cggagtcgga 
cggtccctcg 
ctggagacac 
cggcgctgct 
gctgcgatgg 
cggcggcgtg 
gtcggcattt 
tcgaaggagt 
gttgaattag 
gatgcaacat 
ccatggccaa 
gatcatatga 
agaactatat 
ggtgataccc 
attcttggac 
aagcagaaga 
gtgcagctgg 
ccagacaacc 
gaccacatgg 
ctatacaagg 
cattacgccc 

acatggaagc 
tcgccttgcg 
gccacgttta 
ttctcgatga 
gaatatatgt 
gtttcagttt 
tcaccgtctt 
tgaataaagg 
atatccagct 
tgttctttac 
atgattatgc 
ctccgggcga 
aaagtgtccc 
tttttttttt 
gaagattttc 
tttgcctggc 
acgccgtagc 
atcaaataaa 
cggtgaacgc 
aacggcccgg 
agaaggccat 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 



gagcgctccc 
tgtgtcccgg 
gggccggccc 
cccgcgtgtg 
tggcgttttt 
gtcggtgacg 
tgggccgccg 
ccaccatgag 
atggtgatgt 
acggaaaact 
cacttgtcac 
aacagcatga 
ttttcaaaga 
ttgttaatag 
acaaattgga 
acggaatcaa 
ccgaccacta 
attacctgtc 
tccttcttga 
gatcccatca 
cgccctgcca 

catcacagac 
tataatattt 
aatcaaaact 
atcctttagg 
gtagaaactg 
gctcatggaa 
tcattgccat 
ccggataaaa 
gaacggtctg 
gatgccattg 
aaaaaatacc 
aactttgttt 
cccccccccc 
tttttttttt 
agcctgatac 
ggcagtagcg 
gccgatggta 
acgaaaggct 
tctcctgagt 
agggtggcgg 
cctgacggat 
caaatatgta 
ggaagagtat 
gccttcctgt 
tgggtgcacg 
Page 



tgagcgcgtg 
tcgtaggagg 
cctgcgtgtg 
tcctggggtt 
ggggacaggt 
cgacctcccg 
ggttattagt 
taaaggagaa 
taatgggcac 
tacccttaaa 
tactttcact 
ctttttcaag 
tgacgggaac 
aatcgagtta 
atacaactat 
ggccaacttc 
ccagcagaac 
cacacaatct 
gtttgtaaca 
ccatcaccat 
ctcatcgcag 

ggcatgatga 
gcccatggtg 
ggtgaaactc 
gaagtaggcc 
ccggaaatcg 
aacggtgtaa 
acggaattcc 
cttgtgctta 
gttataggta 
ggatatatca 
cttgtttcta 
tttttttttc 



cccccccggc 
tttggggatc 
agattaaatc 
cggtggtccc 
gtgtggggtc 
cagtcgaaag 
aggacaaatc 
gcaggacgcc 
ggcctttttg 
tccgctcatg 
gagtattcaa 
ttttgctcac 
agtgggttac 
10 



cggcccgaga 
ggccggccga 
gcacgggcgg 
gaccagaggg 
gtccgtgtcc 
gccccggggg 
agaaacaggg 
gaacttttca 
aaattttctg 
tttatttgca 
tatggtgttc 
agtgccatgc 
tacaagacac 
aaaggtattg 
aactcacaca 
aagacccgcc 
accccaattg 
gccctttcga 
gctgctggga 
cactaagctc 
tactgttgta 

acctgaatcg 
aaaacggggg 
acccagggat 
aggttttcac 
tcgtggtatt 
caagggtgaa 
ggatgagcat 
tttttcttta 
cattgagcaa 
acggtggtat 
ctccccccca 
ccccgatgct 
gcggaacggc 
ggccgctagc 
agaacgcaga 
acctgacccc 
tccccatgcg 
^ctgggcctt 
cgccgggagc 
cgccataaac 
cgtttctaca 
agacaataac 
catttccgtg 
ccagaaacgc 
atcgaactgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



atctcaacag cggtaagatc cttgagagtt 
gcacttttaa agttctgcta tgtggcgcgg 
aactcggtcg ccgcatacac tattctcaga 
aaaagcatct tacggatggc atgacagtaa 
gtgataacac tgcggccaac ttacttctga 
cttttttgca caacatgggg gatcatgtaa 
atgaagccat accaaacgac gagcgtgaca 
tgcgcaaact attaactggc gaactactta 
ggatggaggc ggataaagtt gcaggaccac 
ttattgctga taaatctgga gccggtgagc 
ggccagatgg taagccctcc cgtatcgtag 
tggatgaacg aaatagacag atcgctgaga 
tgtcagacca agtttactca tatatacttt 
aaaggatcta ggtgaagatc ctttttgata 
tttcgttcca ctgagcgtca gaccccgtag 
tttttctgcg cgtaatctgc tgcttgcaaa 
gtttgccgga tcaagagcta ccaactcttt 
agataccaaa tactgtcctt ctagtgtagc 
tagcaccgcc tacatacctc gctctgctaa 
ataagtcgtg tcttaccggg ttggactcaa 
cgggctgaac ggggggttcg tgcacacagc 
tgagatacct acagcgtgag ctatgagaaa 
acaggtatcc ggtaagcggc agggtcggaa 
gaaacgcctg gtatctttat agtcctgtcg 
ttttgtgatg ctcgtcaggg gggcggagcc 
tacggttcct ggccttttgc tggccttttg 
attcattaat gcaggtcacg ate 
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ttcgccccga agaacgtttt ccaatgatga 2820 
tattatcccg tgttgacgcc gggcaagagc 2880 
atgacttggt tgagtactca ccagtcacag 2940 
gagaattatg cagtgctgcc ataaccatga 3000 
caacgatcgg aggaccgaag gagctaaccg 3060 
ctcgccttga tcgttgggaa ccggagctga 3120 
ccacgatgcc tgtagcaatg gcaacaacgt 3180 
ctctagcttc ccggcaacaa ttaatagact 3240 
ttctgcgctc ggcccttccg gctggctggt 3300 
gtgggtctcg cggtatcatt gcagcactgg 3360 
ttatctacac gacggggagt caggcaacta 3420 
taggtgcctc actgattaag cattggtaac 3480 
agattgattt aaaacttcat ttttaattta 3540 
atctcatgac caaaatccct taacgtgagt 3600 
aaaagatcaa aggatcttct tgagatcctt 3660 
caaaaaaacc accgctacca gcggtggttt 3720 
ttccgaaggt aactggcttc agcagagcgc 3780 
cgtagttagg ccaccacttc aagaactctg 3840 
tcctgttacc agtggctgct gccagtggcg 3900 
gacgatagtt accggataag gcgcagcggt 3960 
ccagcttgga gcgaacgacc tacaccgaac 4020 
gcgccacgct tcccgaaggg agaaaggcgg 4080 
caggagagcg cacgagggag cttccagggg 4140 
ggtttcgcca cctctgactt gagcgtcgat 4200 
tatggaaaaa cgccagcaac gcggcctttt 4260 
ctcacatgtt ctttcctgcg ttatcccctg 4320 

4343 



<210> 9 
<211> 3888 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pHLl920 
<400> 9 

cccaaaaaaa aaaaaaaaaa aaaaaaaaag agtccagagt ggccccgccg ttccgcgccg 60 
gggggggggg ggggggggga cactttcgga catctggtcg acctccagca tcgggggaaa 120 
aaaaaaaaac aaagtttcgc ccggagtact ggtcgacctc cgaagttggg ggggagtaga 180 
aacagggtag ataatcactc actgagtgac atccacatcg cgagcgcgcg taatacgact 240 
cactataggg cgaattgggt accgggcccc ccctcgaggt cgacggtatc gataagcttc 300 
gacgagattt tcaggagcta aggaagctaa aatggagaaa aaaatcactg gatataccac 360 
cgttgatata tcccaatggc atcgtaaaga acattttgag gcatttcagt cagttgctca 420 
atgtacctat aaccagaccg ttcagctgga tattacggcc tttttaaaga ccgtaaagaa 480 
aaataagcac aagttttatc cggcctttat tcacattctt gcccgcctga tgaatgctca 540 
tccggaattc cgtatggcaa tgaaagacgg tgagctggtg atatgggata gtgttcaccc 600 
ttgttacacc gttttccatg agcaaactga aacgttttca tcgctctgga gtgaatacca 660 
cgacgatttc cggcagtttc tacacatata ttcgcaagat gtggcgtgtt acggtgaaaa 720 
cctggcctat ttccctaaag ggtttattga gaatatgttt ttcgtctcag ccaatccctg 780 
ggtgagtttc accagttttg atttaaacgt ggccaatatg gacaacttct tcgcccccgt 840 
tttcaccatg ggcaaatatt atacgcaagg cgacaaggtg ctgatgccgc tggcgattca 900 
ggttcatcat gccgtttgtg atggcttcca tgtcggcaga atgcttaatg aattacaaca 960 
gtactgcgat gagtggcagg gcggggcgta atttttttaa ggcagttatt ggtgccctta 1020 
aacgcctggt gctacgcctg aataagtgat aataagcgga tgaatggcag aaattcgtcg 1080 
aagcttgata tcgaattcct gcagcccggg ggatccacta gttctagagc ggccgccacc 1140 
gcggtggagc tccagctttt gttcccttta gtgagggtta attgcgcgca ggcctagcta 1200 
ggtaaagaaa aatacccttg attcttctaa taacccggcg gcccaaaatg ccgactcgga 1260 
gcgaaagata tacctccccc ggggccggga ggtcgcgtca ccgaccacgc cgccggccca 1320 
ggcgacgcgc gacacggaca cctgtcccca aaaacgccac catcgcagcc acacacggag 1380 
cgcccggggc cctctggtca accccaggac acacgcggga gcagcgccgg gccggggacg 1440 
ccctcccggc cgcccgtgcc acacgcaggg ggccggcccg tgtctccaga gcgggagccg 1500 
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<210> 12 
<211> 12 
<212> RNA 

<213> influenza C virus 
<400> 12 

ccugcuucug cu 12 

<210> 13 
<211> 12 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified 
influenza A 3' sequence (pHLll04 and 1920) 

<400> 13 

ccuguuucua cu 12 

<210> 14 
<211> 12 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified 
influenza a 3' sequence (pHLl948) 

<400> 14 

ccucguucuc cu 12 

<210> 15 
<211> 13 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified 
influenza A 5' sequence CpHLl920) 

<400> 15 

agaagaauca agg 13 

<210> 16 
<211> 13 
<212> RNA 

<213> influenza A virus 
<400> 16 

aguagaaaca agg 13 

<210> 17 
<211> 13 
<212> RNA 

<213> Influenza B virus 
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<220> 

<221> variation 

<222> 12... 13 

<223> a or g or c or t/u at position 12 or 13 



<400> 17 

aguagwaaca rnn 13 



<210> 18 
<211> 13 
<212> RNA 

<213> influenza C virus 



<210> 19 
<211> 21 
<212> RNA 

<213> Influenza A virus 
<220> 

<221> variation 
<222> 14... 16 

<223> a or g or c or t/u at position 14 or 15 or 16 



<400> 19 

aguagaaaca aggnnnuuuu u 21 



<210> 20 
<211> 21 
<212> RNA 

<213> Artificial Sequence 
<220> 

<221> variation 
<222> 14... 16 

<223> a or g or c or t/u at position 14 or 15 or 16 
Description of Artificial Sequence: Modified 
influenza A 5 '-sequence CpHLl920) 



<210> 21 
<211> 21 

<212> RNA 

<213> Influenza B virus 

<220> 

<221> variation 
<222> 12.. ,16 

<223> a or g or c or t/u at position 12 or 13 



<400> 18 
agcaguagca agr 



13 



<400> 20 

agaagaauca aggnnnuuuu u 



21 



or 14 or 15 or 16 
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<400> 21 

aguagwaaca rnnnnnuuuu u 
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21 



<210> 22 
<211> 19 
<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description 
influenza C 



of Artificial 
5* sequence 



<400> 22 

aguaguaaca agrguuuuu 



Sequence : Modi f i ed 



19 



<210> 23 
<211> 15 
<212> RNA 
<213> Artificial 



Sequence 



<220> 
<221> 
<222> 
<223> 



variation 
1. . .3 

a or g or c or t/u at position 1 or 2 or 3 
Description of Artificial Sequence: Modified 
influenza A 3' sequence (pHLll04 and 1920) 



<400> 23 

nnnccucuuu cuacu 



15 



<210> 24 
<211> 15 
<212> RNA 

<213> Artificial Sequence 



<220> 
<221> 
<222> 
<223> 



variation 
1. . .3 

a or g or c or t/u at position 1 or 2 or 3 
Description of Artificial Sequence: Modified 
influenza A 3' sequence (pHLl948) 



<400> 24 

nnnccucguu cuccu 



15 



<210> 25 
<211> 15 
<212> RNA 

<213> Artificial Sequence 



<220> 
<221> 
<222> 
<223> 



variation 
1. . .5 

a or g or c or t/u at position 1 or 2 
or 3 or 4 or 5 

Description of Artificial sequence: Modified 
influenza B 3' sequence 



<400> 25 



Page 15 



--7048421 

nnnnnyguuu cuacu 



<210> 26 
<211> 14 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified 
influenza C 3' sequence 

<400> 26 
ccccuguuuc uacu 
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SEQUENCE LISTING 

<110> ARTEMIS PHARMACEUTICALS. GmbH 

5 <120> Stable Recombinant Influenza Viruses Free of Helper 
Viruses 

<130> 000520wo/JH/ml 

10 <140> 
<141> 

<160> 26 

15 <170> PatentIn Ver. 2.1 



PCT/EPOO/01903 



20 



25 



30 



35 



40 



<210> 1 
<211> 4930 
<212> DNA 

<213> Artificial Sequence 



45 



50 



55 



60 



<220> 
<223> Desc 

<400> 1 

cgtacgaagc 

agacgaaaaa 

ttagtgaggg 

taataacccg 

ggaggtcgcg 

ccaaaaacgc 

gacacacgcg 

gggggccggc 

gaccgggaca 

ctcagggagc 

aatgaatcag 

aaccgtaaaa 

cacaaaaatc 

gcgtttcccc 

tacctgtccg 

tatctcagtt 

cagcccgacc 

gacttatcgc 

ggtgctacag 

ggtatctgcg 

ggcaaacaaa 

agaaaaaaag 

aacgaaaact 

atccttttaa 

tctgacagtt 

tcatccatag 

tctggcccca 

gcaataaacc 

tccatccagt 

ttgcgcaacg 

gcttcattca 

aaaaaagcgg 

ttatcactca 

tgcttttctg 

ccgagttgct 



ription of Artificial Sequence: pHL2969 



ttctagaggg 

catatgctag 

ttaattgcgc 

gcggcccaaa 

tcaccgacca 

caccatcgca 

ggagcagcgc 

ccgtgtctcc 

cacgagggac 

gctctccgac 

gggataacgc 

aggccgcgtt 

gacgctcaag 

ctggaagctc 

cctttctccc 

cggtgtaggt 

gctgcgcctt 

cactggcagc 

agttcttgaa 

ctctgctgaa 

ccaccgctgg 

gatctcaaga 

cacgttaagg 

attaaaaatg 

accaatgctt 

ttgcctgact 

gtgctgcaat 

agccagccgg 

ctattaattg 

ttgttgccat 

gctccggttc 

ttagctcctt 

tggttatggc 

tgactggtga 

cttgcccggc 



attggctgag 

agcggccgcc 

gcaggcctag 

atgccgactc 

cgccgccggc 

gccacacacg 

cgggccgggg 

agagcgggag 

cgaaggccgg 

tccgcacggg 

aggaaagaac 

gctggcgttt 

tcagaggtgg 

cctcgtgcgc 

ttcgggaagc 

cgttcgctcc 

atccggtaac 

agccactggt 

gtggtggcct 

gccagttacc 

tagcggtggt 

agatcctttg 

gattttggtc 

aagttttaaa 

aatcagtgag 

ccccgtcgtg 

gataccgcga 

aagggccgag 

ttgccgggaa 

tgctacaggc 

ccaacgatca 

cggtcctccg 

agcactgcat 

gtactcaacc 

gtcaacacgg 



acgaaaaaca 

accgcggtgg 

ctaggtaaag 

ggagcgaaag 

ccaggcgacg 

gagcgcccgg 

acgccctccc 

ccggaagcat 

ccaggcgcga 

gactcgccag 

atgtgagcaa 

ttccataggc 

cgaaacccga 

tctcctgttc 

gtggcgcttt 

aagctgggct 

tatcgtcttg 

aacaggatta 

aactacggct 

ttcggaaaaa 

ttttttgttt 

atcttttcta 

atgagattat 

tcaatctaaa 

gcacctatct 

tagataacta 

gacccacgct 

cgcagaagtg 

gctagagtaa 

atcgtggtgt 

aggcgagtta 

atcgttgtca 

aattctctta 

aagtcattct 

gataataccg 



tatgctagag 

agctccagct 

aaaaataccc 

atatacctcc 

cgcgacacgg 

ggccctctgg 

ggccgcccgt 

tttcggccgg 

cctctcgggc 

aaaggatcgt 

aaggccagca 

tccgcccccc 

caggactata 

cgaccctgcc 

ctcatagctc 

gtgtgcacga 

agtccaaccc 

gcagagcgag 

acactagaag 

gagttggtag 

gcaagcagca 

C999Cftctga 

caaaaaggat 

gtatatatga 

cagcgatctg 

cgatacggga 

caccggctcc 

gtcctgcaac 

gtagttcgcc 

cacgctcgtc 

catgatcccc 

gaagtaagtt 

ctgtcatgcc 

gagaatagtg 

cgccacatag 



ggattggctg 

tttgttccct 

ttgtttctac 

cccggggccg 

acacctgtcc 

tcaaccccag 

gccacacgca 

cccctcctac 

cgcacgcgcg 

gacctgcatt 

aaaggccagg 

tgacgagcat 

aagataccag 

gcttaccgga 

acgctgtagg 

accccccgtt 

ggtaagacac 

gtatgtaggc 

gacagtattt 

ctcttgatcc 

gattacgcgc 

cgctcagtgg 

cttcacctag 

gtaaacttgg 

tctatttcgt 

gggcttacca 

agatttatca 

tttatccgcc 

agttaatagt 

gtttggtatg 

catgttgtgc 

ggccgcagtg 

atccgtaaga 

tatgcggcga 

cagaacttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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aaagtgctca tcattggaaa acgttcttcg 
ttgagatcca gttcgatgta acccactcgt 
ttcaccagcg tttctgggtg agcaaaaaca 
agggcgacac ggaaatgttg aatactcata 
tatcagggtt attgtctcat gagcggatac 
aagagtttgt agaaacgcaa aaaggccatc 
cctggcagtt tatggcgggc gtcctgcccg 
caaatccgct cccggcggat ttgtcctact 
aaaacgaaag gcccagtctt tcgactgagc 
ctactctcgc atggggagac cccacactac 
tcggcatggg gtcaggtggg accaccgcgc 
gaccgcttct gcgttctgat ttaatctgta 
aacagaagct agcggccgat ccccaaaaaa 
tggccccgcc gttccgcgcc gggggggggg 
gacctccagc atcgggggaa aaaaaaaaaa 
ccgaagttgg gggggagtag aaacagggta 
ctgactgaaa tgccttgagc aactgactga 
agataatcac tcactgagtg acatccacat 
attgtaccag ggattggctg agacgaaaaa 
atcctggttt tcgcccttgc ggcagtcatc 
catcatgctg tatcaaatgg caccaaagta 
gtcaatgcaa cggaaacagt ggagcggaca 
agaaccactg atcttggcca atgcggactg 
gaccaatttc tagaattttc agctgatcta 
tgttacccgg ggaagtttgt taatgaagag 
gggattgaca aagaaacaat gggattcaca 
agtgcatgta gaagatcagg gtcttcattc 
acagacaatg cttctttccc acaaatgaca 
gctctgatag tctggggaat ccaccattca 
gggagtggaa ataaactgat aacagtcggg 
agtccaggaa cacgaccgca gataaatggc 
atcttggatc ccaatgatac agttactttt 
cgtgccagct tcttgagggg aaagtccatg 
aattgcgaag gggaatgcta ccacagtgga 
aacataaata gcagagcagt tggcaaatgc 
ttggcaactg ggatgaagaa cgttcccgaa 
tttggtgcta tagcagggtt tattgaaaat 
ggtttcaggc atcagaatgc acaaggagaa 
tcggcaattg atcagataac cggaaagtta 
tttgagctaa tagataatga attcactgaa 
tggaccaaag actccatcac agaagtatgg 
gaaaaccagc acactattga tttggctgat 
aggaaacaat taagggaaaa tgctgaagag 
aaatgtgacg atgattgtat ggctagtata 
agagaagaag cgatgcaaaa tagaatacaa 
aaagatgtga tactttggtt tagcttcggg 
atgggccttg ttttcatatg tgtgaagaac 
agcttgcatg 
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gggcgaaaac tctcaaggat cttaccgctg 2160 
gcacccaact gatcttcagc atcttttact 2220 
ggaaggcaaa atgccgcaaa aaagggaata 2280 
ctcttccttt ttcaatatta ttgaagcatt 2340 
atatttgaat gtatttagaa aaataaacaa 2400 
cgtcaggatg gccttctgct taatttgatg 2460 
ccaccctccg ggccgttgct tcgcaacgtt 2520 
caggagagcg ttcaccgaca aacaacagat 2580 
ctttcgtttt atttgatgcc tggcagttcc 2640 
catcggcgct acggcgtttc acttctgagt 2700 
tactgccgcc aggcaaattc tgttttatca 2760 
tcaggctgaa aatcttctct catccgccaa 2820 
aaaaaaaaaa aaaaaaaaaa gagtccagag 2880 
gggggggggg acactttcgg acatctggtc 2940 
caaagtttcg cccggagtac tggtcgacct 3000 
gataatcact cactgacgta cgttgagcaa 3060 
aatgcctgac gtctttagca aaagcagggt 3120 
cgtaccagga ttggctgaga cgaaaaacat 3180 
catattgtag gtaccaaaat gaacactcaa 3240 
cccacaaatg cagacaaaat. ttgtcttgga 3300 
aacacactca ctgagagagg agtagaagtt 3360 
aacatcccca aaatttgctc aaaagggaaa 3420 
ttagggacca ttaccggacc acctcaatgc 3480 
ataatcgaga gacgagaagg aaatgatgtt 3540 
gcattgcgac aaatcctcag aggatcaggt 3600 
tatagtggaa taaggaccaa cggaacaact 3660 
tatgcagaaa tggagtggct cctgtcaaat 3720 
aaatcataca aaaacacagg gagagaatca 3780 
ggatcaacca ccgaacagac caaactatat 3840 
agttccaaat atcatcaatc ttttgtgccg 3900 
cggtccggac ggattgattt tcattggttg 3960 
agtttcaatg gggctttcat agctccaaat 4020 
gggatccaga gcgatgtgca ggttgatgct 4080 
gggactataa caagcagatt gccttttcaa 4140 
ccaagatatg taaaacagga aagtttatta 4200 
ccttccaaaa aaaggaaaaa aagaggcctg 4260 
ggttgggaag gtctggtcga cgggtggtac 4320 
ggaactgcag cagactacaa aagcacccaa 4380 
aatagactca ttaagaaaac caaccagcaa 4440 
gtggaaaagc agattggcaa tttaattaac 4500 
tcttacaatg ctgaacttct tgtggcaatg 4560 
tcagagatga acaagctgta tgagcgagtg 4 620 
gatggcactg gttgctttga aatttttcat 4 680 
aggaacaata cttatgatca cagcaaatac 4740 
attgacccag tcaaattgag tagtggctac 4800 
gcatcatgct ttttgcttct tgccattgca 4860 
ggaaacatgc ggtgcactat atgcatttaa 4920 

4930 



<210> 2 
<211> 5811 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pAM403 
<400> 2 

aattcctttg cctaatttaa atgaggactt aacctgtgga aatattttga tgtgggaagc 60 
tgttactgtt aaaactgagg ttattggggt aactgctatg ttaaacttgc attcagggac 120 
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acaaaaaact catgaaaatg gtgctggaaa 
tgctgttggt ggggaacctt tggagctgca 
tcctgctcaa actgtaaccc caaaaaatgc 
ccacaaggct gttttggata aggataatgc 
5 aagtaaaaat gaaaacacta gatattttgg 
tgttttgcac attactaaca cagcaaccac 
cttgtgcaaa gctgacagct tgtatgtttc 
cacttctgga acacagcagt ggaagggact 
gcggtctgtg aaaaacccct acccaatttc 

10 gacacagagg gtggatgggc agcctatgat 
ggtttatgag gacacagagg agcttcctgg 
gtttggacaa accacaacta gaatgcagtg 
tgctattgct ttatttgtaa ccattataag 
cattcatttt atgtttcagg ttcaggggga 

15 cctctacaaa tgtggtatgg ctgattatga 
tccttattaa cccctttaca aattaaaaag 
ttaatagcag acactctatg cctgtgtgga 
ctgttatgcc tacttataaa ggttacagaa 
gcagcttttt cctttgtggt gtaaatagca 

20 gcatgactca aaaaacttag caattctgaa 
ttcttttttg gaggagtaga atgttgagag 

^ atttcttctg agcaaaacag gttttcctca 

^- tcagttccat aggttggaat ctaaaataca 
tatacactta aaaattttat atttacctta 

25 aattatgtca caccacagaa gtaaggttcc 
cgtgcctccc cactcctgca gttcgggggc 
ctggatgccg acggatttgc actgccggta 
gcagctgaac caactcgcga ggggatcgag 
atccccgcgc tggaggatca tccagccggc 

30 ttcatagaag gcggcggtgg aatcgaaatc 
ggtcatttcg atgaattcga gctcggtacc 
tcctcacgga ctcatcagag ttgctcaatt 
gtcaagaagg cgatagaagg cgatgcgctg 
gaggaagcgg tcagcccatt cgccgccaag 

35 tatgtcctga tagcggtccg ccacacccag 
gccattttcc accatgatat tcggcaagca 
gccgtcgggc atgcgcgcct tgagcctggc 
ctcttcgtcc agatcatcct gatcgacaag 
gatgcgatgt ttcgcttggt ggtcgaatgg 

40 ccgcattgca tcagccatga tggatacttt 
atcctgcccc ggcacttcgc ccaatagcag 

.V:: gagcacagct gcgcaaggaa cgcccgtcgt 

■'^■^ ctgcagttca ttcagggcac cggacaggtc 
cgctgacagc cggaacacgg cggcatcaga 

45 gccgaatagc ctctccaccc aagcggccgg 
catgcgaaac gatcctcatc ctgtctcttg 
ccttggcggc aagaaagcca tccagtttac 
cgccccagct ggcaattccg gttcgcttgc 
ccatgtaagc ccactgcaag ctacctgctt 

50 gatagcccag tagctgacat tcatccgggg 
cgtgttccgc ttcctttagc agcccttgcg 
ttgcaaaagc ctaggcctcc aaaaaagcct 
gaggcggcct cggcctctgc ataaataaaa 
gcggaactgg gcggagttag gggcgggatg 

55 gactaattga gatgcatgct ttgcatactt 
cacctggttg ctgactaatt gagatgcatg 
ggggactttc cacaccctaa ctgacacaca 
gacggtgaaa acctctgaca catgcagctc 
gatgccggga gcagacaagc ccgtcagggc 

60 gcagccatga cccagtcacg tagcgatagc 
cagagcagat tgtactgaga gtgcaccata 
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acccattcaa gggtcaaatt ttcatttttt 180 
gggtgtgtta gcaaactaca ggaccaaata 240 
tacagttgac agtcagcaga tgaacactga 300 
ttatccagtg gagtgctggg ttcctgatcc 360 
aacctacaca ggtggggaaa atgtgcctcc 420 
agtgcttctt gatgagcagg gtgttgggcc 480 
tgctgttgac atttgtgggc tgtttaccaa 540 
tcccagatat tttaaaatta cccttagaaa 600 
ctttttgtta agtgacctaa ttaacaggag 660 
tggaatgtcc tctcaagtag aggaggttag 720 
ggatccagac atgataagat acattgatga 780 
aaaaaaatgc tttatttgtg aaatttgtga 840 
ctgcaataaa caagttaaca acaacaattg 900 
ggtgtgggag gttttttaaa gcaagtaaaa 960 
tctctagtca aggcactata catcaaatat 1020 
ctaaaggtac acaatttttg agcatagtta 1080 
gtaagaaaaa acagtatgtt atgattataa 1140 
tatttttcca taattttctt gtatagcagt 1200 
aagcaagcaa gagttctatt actaaacaca 1260 
ggaaagtcct tggggtcttc tacctttctc 1320 
tcagcagtag cctcatcatc actagatggc 1380 
ttaaaggcat tccaccactg ctcccattca 1440 
caaacaatta gaatcagtag tttaacacat 1500 
gagctttaaa tctctgtagg tagtttgtcc 1560 
ttcacaaaga tccgggacca aagcggccat 1620 
atggatgcgc ggatagccgc tgctggtttc 1680 
gaactccgcg aggtcgtcca gcctcaggca 174 0 
cccggggtgg gcgaagaact ccagcatgag 1800 
gtcccggaaa acgattccga agcccaacct 18 60 
tcgtgatggc aggttgggcg tcgcttggtc 1920 
cggggatcct ctagaggcat ttcagtttcg 1980 
cgaaccccag agtcccgctc agaagaactc 2040 
cgaatcggga gcggcgatac cgtaaagcac 2100 
ctcttcagca atatcacggg tagccaacgc 2160 
ccggccacag tcgatgaatc cagaaaagcg 2220 
ggcatcgcca tgggtcacga cgagatcctc 2280 
gaacagttcg gctggcgcga gcccctgatg 2340 
accggcttcc atccgagtac gtgctcgctc 2400 
gcaggtagcc ggatcaagcg tatgcagccg 24 60 
ctcggcagga gcaaggtgag atgacaggag 2520 
ccagtccctt cccgcttcag tgacaacgtc 2580 
ggccagccac gatagccgcg ctgcctcgtc 2640 
ggtcttgaca aaaagaaccg ggcgcccctg 2700 
gcagccgatt gtctgttgtg cccagtcata 2760 
agaacctgcg tgcaatccat cttgttcaat 2820 
atcagatctt gatcccctgc gccatcagat 2880 
tttgcagggc ttcccaacct taccagaggg 2940 
tgtccataaa accgcccagt ctagctatcg 3000 
tctctttgcg cttgcgtttt cccttgtcca 3060 
tcagcaccgt ttctgcggac tggctttcta 3120 
ccctgagtgc ttgcggcagc gtgaagcttt 3180 
cctcactact tctggaatag ctcagaggcc 3240 
aaaattagtc agccatgggg cggagaatgg 3300 
ggcggagtta ggggcgggac tatggttgct 3360 
ctgcctgctg gggagcctgg ggactttcca 3420 
ctttgcatac ttctgcctgc tggggagcct 3480 
ttccacagct gcctcgcgcg tttcggtgat 3540 
ccggagacgg tcacagcttg tctgtaagcg 3600 
gcgtcagcgg gtgttggcgg gtgtcggggc 3660 
ggagtgtata ctggcttaac tatgcggcat 3720 
tgcggtgtga aataccgcac agatgcgtaa 3780 
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ggagaaaata 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 
ccgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 
tgacattaac 



ccgcatcagg 

gcggcgagcg 

taacgcagga 

cgcgttgctg 

ctcaagtcag 

aagctccctc 

tctcccttcg 

gtaggtcgtt 

cgccttatcc 

ggcagcagcc 

cttgaagtgg 

gctgaagcca 

cgctggtagc 

tcaagaagat 

ttaagggatt 

aaaatgaagt 

atgcttaatc 

ctgactcccc 

tgcaatgata 

agccggaagg 

taattgttgc 

tgccattgct 

cggttcccaa 

ctccttcggt 

tatggcagca 

tggtgagtac 

cccggcgtca 

tggaaaacgt 

gatgtaaccc 

tgggtgagca 

atgttgaata 

tctcatgagc 

cacatttccc 

ctataaaaat 



cgctcttccg 

gtatcagctc 

aagaacatgt 

gcgtttttcc 

aggtggcgaa 

gtgcgctctc 

ggaagcgtgg 

cgctccaagc 

ggtaactatc 

actggtaaca 

tggcctaact 

gttaccttcg 

ggtggttttt 

cctttgatct 

ttggtcatga 

tttaaatcaa 

agtgaggcac 

gtcgtgtaga 

ccgcgagacc 

gccgagcgca 

cgggaagcta 

gcaggcatcg 

cgatcaaggc 

cctccgatcg 

ctgcataatt 

tcaaccaagt 

acacgggata 

tcttcggggc 

actcgtgcac 

aaaacaggaa 

ctcatactct 

ggatacatat 

cgaaaagtgc 

aggcgtatca 



cttcctcgct 

actcaaaggc 

gagcaaaagg 

ataggctccg 

acccgacagg 

ctgttccgac 

cgctttctca 

tgggctgtgt 

gtcttgagtc 

ggattagcag 

acggctacac 

gaaaaagagt 

ttgtttgcaa 

tttctacggg 

gattatcaaa 

tctaaagtat 

ctatctcagc 

taactacgat 

cacgctcacc 

gaagtggtcc 

gagtaagtag 

tggtgtcacg 

gagttacatg 

ttgtcagaag 

ctcttactgt 

cattctgaga 

ataccgcgcc 

gaaaactctc 

ccaactgatc 

ggcaaaatgc 

tcctttttca 

ttgaatgtat 

cacctgacgt 

cgaggccctt 



cactgactcg 

ggtaatacgg 

ccagcaaaag 

cccccctgac 

actataaaga 

cctgccgctt 

tagctcacgc 

gcacgaaccc 

caacccggta 

agcgaggtat 

tagaaggaca 

tggtagctct 

gcagcagatt 

gtctgacgct 

aaggatcttc 

atatgagtaa 

gatctgtcta 

acgggagggc 

ggctccagat 

tgcaacttta, 

ttcgccagtt 

ctcgtcgttt 

atcccccatg 

taagttggcc 

catgccatcc 

atagtgtatg 

acatagcaga 

aaggatctta 

ttcagcatct 

cgcaaaaaag 

atattattga 

ttagaaaaat 

ctaagaaacc 

tcgtcttcaa 



ctgcgctcgg 

ttatccacag 

gccaggaacc 

gagcatcaca 

taccaggcgt 

accggatacc 

tgtaggtatc 

cccgttcagc 

agacacgact 

gtaggcggtg 

gtatttggta 

tgatccggca 

acgcgcagaa 

cagtggaacg 

acctagatcc 

acttggtctg 

tttcgttcat 

ttaccatctg 

ttatcagcaa 

tccgcctcca 

aatagtttgc 

ggtatggctt 

ttgtgcaaaa 

gcagtgttat 

gtaagatgct 

cggcgaccga 

actttaaaag 

ccgctgttga 

tttactttca 

ggaataaggg 

agcatttatc 

aaacaaatag 

attattatca 

g 



3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5811 
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50 
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<210> 3 
<211> 2005 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vHM41 



<400> 3 

agtagaaaca 

aagggaacaa 

tcagccaatc 

gcatgcaagc 

acaaggccca 

atcacatctt 

gcttcttctc 

tcgtcacatt 

aattgtttcc 

tgctggtttt 

tctttggtcc 

attagctcaa 

tcaattgccg 

tgcctgaaac 

atagcaccaa 

ccagttgcca 



agggtatttt 
aagctggagc 
cctctagcat 
tttaaatgca 
ttgcaatggc 
tgtagccact 
tgtatttgct 
tatgaaaaat 
tcactcgctc 
ccattgccac 
agttaattaa 
attgctggtt 
attgggtgct 
cgtaccaccc 
acaggcctct 
ataataaact 



tctttaccta 
tccaccgcgg 
atgtttttcg 
tatagtgcac 
aagaagcaaa 
actcaatttg 
gtgatcataa 
ttcaaagcaa 
atacagcttg 
aagaagttca 
attgccaatc 
ggttttctta 
tttgtagtct 
gtcgaccaga 
ttttttcctt 
ttcctgtttt 



gctaggcctg 
tggcggccgc 
tctcagccaa 
cgcatgtttc 
aagcatgatg 
actgggtcaa 
gtattgttcc 
ccagtgccat 
ttcatctctg 
gcattgtaag 
tgcttttcca 
atgagtctat 
gctgcagttc 
ccttcccaac 
tttttggaag 
acatatcttg 



cgcgcaatta 
tctagcatat 
tccctctaga 
cgttcttcac 
ccccgaagct 
tttgtattct 
ttatactagc 
cctcttcagc 
aatcagccaa 
accatacttc 
cttcagtgaa 
ttaactttcc 
cttctccttg 
cattttcaat 
gttcgggaac 
ggcatttgcc 



accctcacta 
gtttttcgtc 
agcttcgtac 
acatatgaaa 
aaaccaaagt 
attttgcatc 
catacaatca 
attttccctt 
atcaatagtg 
tgtgatggag 
ttcattatct 
ggttatctga 
tgcattctga 
aaaccctgct 
gttcttcatc 
aactgctctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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720 

780 

840 

900 

960 
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ctatttatgt 
ccttcgcaat 
aagctggcac 
ggatccaaga 
gttcctggac 
tttccactcc 
actatcagag 
gcattgtctg 
ctacatgcac 
ttgtcaatcc 
cccgggtaac 
agaaattggt 
tcagtggttc 
gttgcattga 
acagcatgat 
aaaaccagga 
cctggtacaa 
agtgattatc 



tttgaaaagg 
tagcatcaac 
gatttggagc 
tcaaccaatg 
tcggcacaaa 
catatagttt 
ctgattctct 
tatttgacag 
tagttgttcc 
cacctgatcc 
aaacatcatt 
cgcattgagg 
ttttcccttt 
caacttctac 
gtccaagaca 
tttgagtgtt 
tatgtttttc 
taccctgctt 



caatctgctt 
ctgcacatcg 
tatgaaagcc 
aaaatcaatc 
agattgatga 
ggtctgttcg 
ccctgtgttt 
gagccactcc 
gttggtcctt 
tctgaggatt 
tccttctcgt 
tggtccggta 
tgagcaaatt 
tcctctctca 
aattttgtct 
cattttggta 
gtctcagcca 
ttgct 



gttatagtcc 
ctctggatcc 
ccattgaaac 
cgtccggacc 
tatttggaac 
gtggttgatc 
ttgtatgatt 
atttctgcat 
attccactat 
tgtcgcaatg 
ctctcgatta 
atggtcccta 
ttggggatgt 
gtgagtgtgt 
gcatttgtgg 
cctacaatat 
atcctggtac 



c tccactgtg 
ccatggactt 
taaaagtaac 
ggccatttat 
tcccgactgt 
ctgaatggtg 
ttgtcatttg 
agaatgaaga 
atgtgaatcc 
cctcttcatt 
ttagatcagc 
acagtccgca 
ttgtccgctc 
ttactttggt 
ggatgactgc 
gtttttcgtc 
gatgtggatg 



gtagcattcc 
tcccctcaag 
tgtatcattg 
ctgcggtcgt 
tatcagttta 
gattccccag 
tgggaaagaa 
ccctgatctt 
cattgtttct 
aacaaacttc 
tgaaaattct 
ttggccaaga 
cactgtttcc 
gccatttgat 
cgcaagggcg 
tcagccaatc 
tcactcagtg 



<210> 4 
<211> 1146 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vHM81 



<400> 4 

agtagaaaca 

aagggaacaa 

tcagccaatc 

gcatgcttaa 

ttgtaaggct 

tctcacttct 

ttttccgttt 

cgaatctctg 

tgaggacccc 

gtgaaatttc 

ccagccgatc 

cctggtccat 

agtcccttga 

caatggtcat 

ttccaacacg 

acttctgatc 

atcgtttgcg 

ccattatggt 

cgtctcagcc 

tttgct 



agggtatttt 
aagctggagc 
cctctagcat 
ataagctgaa 
tgcataaatg 
tcaattaacc 
ctgctttgga 
tatagtttca 
aattgcattt 
gccgacaatt 
gaaaatcaca 
tctgatgcaa 
catctcttca 
tttaagtgcc 
agtagctgtg 
tcggcgaagt 
gacatgccaa 
acctacaata 
aatcctggta 



tctttaccta 
tccaccgcgg 
atgtttttcg 
acgagaaagt 
ttatttgttc 
atcttatttc 
gggagtggag 
gagactcgaa 
ttgacatcct 
gctccctcat 
ctgaagtttg 
agggagcctg 
agagtcatgt 
tcatcggatt 
tcgatgtcca 
cggtcaagga 
agaaagcagt 
tgtttttcgt 
cgatgtggat 



gctaggcctg 
tggcggccgc 
tctcagccaa 
tcttatctct 
aaaactattc 
ttcaaatttc 
gtcccccatt 
ctgtgttatc 
catcagtatg 
cggttaaagc 
ctttcagtat 
ccactttctg 
cagttaggta 
cgtcctccag 
gaccaagagt 
atggggcatc 
ctacctgaaa 
ctcagccaat 
gtcactcagt 



cgcgcaatta 
tctagcatat 
tccctctaga 
tgctccactt 
tctgttatct 
tgactcaatt 
ctcattactg 
attccattca 
tcctggaaga 
ccttaatagt 
gatgttcttc 
tttgggcatg 
gcgtgtagca 
aatccgctcc 
gctgcctctt 
acccatttct 
gcttgacaca 
ccctggtaca 
gagtgattat 



accctcacta 
gtttttcgtc 
agcttcgtac 
caagcggtag 
tcaatctatg 
gttctcgcca 
cttctccaag 
agtcctccga 
gaaggcaatg 
atgagagttt 
cccatgatcg 
agcatgaacc 
ggtacagagg 
actatctgct 
cccctcaggg 
tggtctgcaa 
gtgttggaat 
atatgttttt 
ctaccctgct 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2005 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1146 



<210> 5 

<211> 5860 

<212> DNA 

55 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: pAM424 

60 <400> 5 

catcgattgg ctgactgatg agtccgtgag gacgaaacga aaaacatatt gtagagctcg 60 
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aattcatcga 
ccgccgcctt 
tcctccagcg 
cgagttggtt 
aatccgtcgg 
aggagtgggg 
tgtggtgtga 
aaatttttaa 
ccaacctatg 
ttttgctcag 
actcctccaa 
agttttttga 
acaaaggaaa 
ataagtaggc 
agagtgtctg 
aaaggggtta 
taccacattt 
gaaacataaa 
caaataaagc 
ttgtggtttg 
ctgtgtcctc 
catccaccct 
ggtttttcac 
gctgtgttcc 
tgtcagcttt 
tagtaatgtg 
tgttttcatt 
ccaaaacagc 
ttacagtttg 
gttccccacc 
tttcatgagt 
cagttttaac 
aattaggcaa 
taatgtcatg 
cggaacccct 
ataaccctga 
ccgtgtcgcc 
aacgctggtg 
actggatctc 
gatgagcact 
agagcaactc 
cacagaaaag 
catgagtgat 
aaccgctttt 
gctgaatgaa 
aacgttgcgc 
agactggatg 
ctggtttatt 
actggggcca 
aactatggat 
gtaactgtca 
atttaaaagg 
tgagttttcg 
tccttttttt 
ggtttgtttg 
agcgcagata 
ctctgtagca 
tggcgataag 
gcggtcgggc 
cgaactgaga 
ggcggacagg 



aatgaccgac 
ctatgaaagg 
cggggatctc 
cagctgctgc 
catccaggaa 
aggcacgatg 
cataattgga 
gtgtataatg 
gaactgatga 
aagaaatgcc 
aaaagaagag 
gtcatgctgt 
aagctgcact 
ataacagtta 
ctattaataa 
ataaggaata 
gtagaggttt 
atgaatgcaa 
aatagcatca 
tccaaactca 
ataaacccta 
ctgtgtcctc 
agaccgcttt 
agaagtgttg 
gcacaagggc 
caaaacagga 
tttacttgga 
cttgtggtca 
agcaggatat 
aacagcaaaa 
tttttgtgtc 
agtaacagct 
aggaattctt 
ataataatgg 
atttgtttat 
taaatgcttc 
cttattccct 
aaagtaaaag 
aacagcggta 
tttaaagttc 
ggtcgccgca 
catcttacgg 
aacactgcgg 
ttgcacaaca 
gccataccaa 
aaactattaa 
gaggcggata 
gctgataaat 
gatggtaagc 
gaacgaaata 
gaccaagttt 
atctaggtga 
ttccactgag 
ctgcgcgtaa 
ccggatcaag 
ccaaatactg 
ccgcctacat 
tcgtgtctta 
tgaacggggg 
tacctacagc 
tatccggtaa 



caagcgacgc 
ttgggcttcg 
atgctggagt 
ctgaggctgg 
accagcagcg 
gccgctttgg 
caaactacct 
tgttaaacta 
atgggagcag 
atctagtgat 
aaaggtagaa 
gtttagtaat 
gctatacaag 
taatcataac 
ctatgctcaa 
tttgatgtat 
tacttgcttt 
ttgttgttgt 
caaatttcac 
tcaatgtatc 
acctcctcta 
ctgttaatta 
ctaagggtaa 
gtaaacagcc 
ccaacaccct 
ggcacatttt 
tcaggaaccc 
gtgttcatct 
ttggtcctgt 
aaatgaaaat 
cctgaatgca 
tcccacatca 
gaagacgaaa 
tttcttagac 
ttttctaaat 
aataatattg 
tttttgcggc 
atgctgaaga 
agatccttga 
tgctatgtgg 
tacactattc 
atggcatgac 
ccaacttact 
tgggggatca 
acgacgagcg 
ctggcgaact 
aagttgcagg 
ctggagccgg 
cctcccgtat 
gacagatcgc 
actcatatat 
agatcctttt 
cgtcagaccc 
tctgctgctt 
agctaccaac 
tccttctagt 
acctcgctct 
ccgggttgga 
gttcgtgcac 
gtgagctatg 
gcggcagggt 



ccaacctgcc 
gaatcgtttt 
tcttcgccca 
acgacctcgc 
gctatccgcg 
tcccggatct 
acagagattt 
ctgattctaa 
tggtggaatg 
gatgaggcta 
gaccccaagg 
agaactcttg 
aaaattatgg 
atactgtttt 
aaattgtgta 
agtgccttga 
aaaaaacctc 
taacttgttt 
aaataaagca 
ttatcatgtc 
cttgagagga 
ggtcacttaa 
ttttaaaata 
cacaaatgtc 
gctcatcaag 
ccccacctgt 
agcactccac 
gctgactgtc 
agtttgctaa 
ttgacccttg 
agtttaacat 
aaatatttcc 
gggcctcgtg 
gtcaggtggc 
acattcaaat 
aaaaaggaag 
attttgcctt 
tcagttgggt 
gagttttcgc 
cgcggtatta 
tcagaatgac 
agtaagagaa 
tctgacaacg 
tgtaactcgc 
tgacaccacg 
acttactcta 
accacttctg 
tgagcgtggg 
cgtagttatc 
tgagataggt 
actttagatt 
tgataatctc 
cgtagaaaag 
gcaaacaaaa 
tctttttccg 
gtagccgtag 
gctaatcctg 
ctcaagacga 
acagcccagc 
agaaagcgcc 
cggaacagga 



atcacgagat 
ccgggacgcc 
ccccgggctc 
ggagttctac 
catccatgcc 
ttgtgaagga 
aaagctctaa 
ttgtttgtgt 
cctttaatga 
ctgctgactc 
actttccttc 
cttgctttgc 



aaaaatattc 
ttcttactcc 
cctttagctt 
ctagagatca 
ccacacctcc 
attgcagctt 
tttttttcac 
tggatcccca 
cattccaatc 
caaaaaggaa 
tctgggaagt 
aacagcagaa 
aagcactgtg 
gtaggttcca 
tggataagca 
aactgtagca 
cacaccctgc 
aatgggtttt 
agcagttacc 
acaggttaag 
atacgcctat 
acttttcggg 
atgtatccgc 
agtatgagta 
cctgtttttg 
gcacgagtgg 
cccgaagaac 
tcccgtgttg 
ttggttgagt 
ttatgcagtg 
atcggaggac 
cttgatcgtt 
atgcctgcag 
gcttcccggc 
cgctcggccc 
tctcgcggta 
tacacgacgg 
gcctcactga 
gatttaaaac 
atgaccaaaa 
atcaaaggat 
aaaccaccgc 
aaggtaactg 
ttaggccacc 
ttaccagtgg 
tagttaccgg 
ttggagcgaa 
acgcttcccg 
gagcgcacga 



ttcgattcca 
ggctggatga 
gatcccctcg 
cggcagtgca 
cccgaactgc 
accttacttc 
ggtaaatata 
attttagatt 
ggaaaacctg 
tcaacattct 
agaattgcta 
tatttacacc 
tgtaaccttt 
acacaggcat 
tttaatttgt 
taatcagcca 
ccctgaacct 
ataatggtta 
tgcattctag 
ggaagctcct 
ataggctgcc 
attgggtagg 
cccttccact 
acatacaagc 
gttgctgtgt 
aaatatctag 
ttatccttat 
ttttttgggg 
agctccaaag 
ccagcaccat 
ccaataacct 
tcctcattta 
ttttataggt 
gaaatgtgcg 
tcatgagaca 
ttcaacattt 
ctcacccaga 
gttacatcga 
gttttccaat 
acgccgggca 
actcaccagt 
ctgccataac 
cgaaggagct 
gggaaccgga 
caatggcaac 
aacaattaat 



ttccggctgg 
tcattgcagc 
ggagtcaggc 
ttaagcattg 
ttcattttta 
tcccttaacg 
cttcttgaga 
taccagcggt 
gcttcagcag 
acttcaagaa 
ctgctgccag 
ataaggcgca 
cgacctacac 
aagggagaaa 
gggagcttcc 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
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15 



20 



25 



30 



35 



agggggaaac 
tcgatttttg 
ctttttacgg 
ccctgattct 
ccgaacgacc 
ttttctcctt 
ctgctctgat 
atggctgcgc 
ccggcatccg 
tcaccgtcat 
aagtccccag 
accaggtgtg 
aattagtcag 
agttccgccc 
gccgcctcgg 
ttttgcaaaa 
ggaacacgta 
gggctatctg 
ttacatggcg 
ctggggcgcc 
cgccaaggat 
ttcgcatgat 
tattcggcta 
tgtcagcgca 
aactgcagga 
ctgtgctcga 
ggcaggatct 
caatgcggcg 
atcgcatcga 
acgaagagca 
ccgacggcga 
aaaatggccg 
aggacatagc 
gcttcctcgt 
ttcttgacga 
ctgatgagtc 



gcctggtatc 

tgatgctcgt 

ttcctggcct 

gtggataacc 

gagcgcagcg 

acgcatctgt 

gccgcatagt 

cccgacaccc 

cttacagaca 

caccgaaacg 

gctccccagc 

gaaagtcccc 

caaccatagt 

attctccgcc 

cctctgagct 

agcttcacgc 

gaaagccagt 

gacaagggaa 

atagctagac 

ctctggtaag 

ctgatggcgc 

tgaacaagat 

tgactgggca 

ggggcgcccg 

cgaggcagcg 

cgttgtcact 

cctgtcatct 

gctgcatacg 

gcgagcacgt 

tcaggggctc 

ggatctcgtc 

cttttctgga 

gttggctacc 

gctttacggt 

gttcttctga 

cgtgaggacg 



tttatagtcc 
caggggggcg 
tttgctggcc 
gtattaccgc 
agtcagtgag 
gcggtatttc 
taagccagta 
gccaacaccc 
agctgtgacc 
cgcgaggcag 
aggcagaagt 
aggctcccca 
cccgccccta 
ccatggctga 
attccagaag 
tgccgcaagc 
ccgcagaaac 
aacgcaagcg 
tgggcggttt 
gttgggaagc 
^99gg^tcaa 
ggattgcacg 
caacagacaa 
gttctttttg 
cggctatcgt 
gaagcgggaa 
caccttgctc 
cttgatccgg 
actcggatgg 
gcgccagccg 
gtgacccatg 
ttcatcgact 
cgtgatattg 
atcgccgctc 
gcgggactct 
aaacgaaaaa 



tgtcgggttt 
gagcctatgg 
ttttgctcac 
ctttgagtga 
cgaggaagcg 
acaccgcata 
tacactccgc 
gctgacgcgc 
gtctccggga 
ctgtggaatg 
atgcaaagca 
gcaggcagaa 
actccgccca 
ctaatttttt 
tagtgaggag 
actcagggcg 
ggtgctgacc 
caaagagaaa 
tatggacagc 
cctgcaaagt 
gatctgatca 
caggttctcc 
tcggctgctc 
tcaagaccga 
ggctggccac 
gggactggct 
ctgccgagaa 
ctacctgccc 
aagccggtct 
aactgttcgc 
gcgatgcctg 
gtggccggct 
ctgaagagct 
ccgattcgca 
ggggttcgaa 
catatggtac 



cgccacctct 
aaaaacgcca 
atgttctttc 
gctgataccg 
gaagagcgcc 
tggtgcactc 
tatcgctacg 
cctgacgggc 
gctgcatgtg 
tgtgtcagtt 
tgcatctcaa 
gtatgcaaag 
tcccgcccct 
ttatttatgc 
gcttttttgg 
caagggctgc 
ccggatgaat 
gcaggtagct 
aagcgaaccg 
aaactggatg 
agagacagga 
ggccgcttgg 
tgatgccgcc 
cctgtccggt 
gacgggcgtt 
gctattgggc 
agtatccatc 
attcgaccac 
tgtcgatcag 
caggctcaag 
cttgccgaat 
gggtgtggcg 
tggcggcgaa 
gcgcatcgcc 
tcctaccagg 



gacttgagcg 
gcaacgcggc 
ctgcgttatc 
ctcgccgcag 
tgatgcggta 
tcagtacaat 
tgactgggtc 
ttgtctgctc 
tcagaggttt 
agggtgtgga 
ttagtcagca 
catgcatctc 
aactccgccc 
agaggccgag 
aggcctaggc 
taaaggaagc 
gtcagctact 
tgcagtgggc 
gaattgccag 
gctttcttgc 
tgaggatcgt 
gtggagaggc 
gtgttccggc 
gccctgaatg 
ccttgcgcag 
gaagtgccgg 
atggctgatg 
caagcgaaac 
gatgatctgg 
gcgcgcatgc 
atcatggtgg 
gaccgctatc 
tgggctgacc 
ttctatcgcc 
gattggctga 



3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5860 



40 



45 



50 



55 



60 



<210> 6 
<211> 4610 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pHL2507 



<400> 6 

gaggcatttc 

cttgcggcag 

aatggcacca 

acagtggagc 

ggccaatgcg 

ttttcagctg 

tttgttaatg 

acaatgggat 

tcagggtctt 

ttcccacaaa 

ggaatccacc 

ctgataacag 

ccgcagataa 

gatacagtta 



agtcagttgc 
tcatccccac 
aagtaaacac 
ggacaaacat 
gactgttagg 
atctaataat 
aagaggcatt 
tcacatatag 
cattctatgc 
tgacaaaatc 
attcaggatc 
tcgggagttc 
atggccggtc 
cttttagttt 



tcaaggtacc 
aaatgcagac 
actcactgag 
ccccaaaatt 
gaccattacc 
cgagagacga 
gcgacaaatc 
tggaataagg 
agaaatggag 
atacaaaaac 
aaccaccgaa 
caaatatcat 
cggacggatt 
caatggggct 



aaaatgaaca 
aaaatttgtc 
agaggagtag 
tgctcaaaag 
ggaccacctc 
gaaggaaatg 
ctcagaggat 
accaacggaa 
tggctcctgt 
acagggagag 
cagaccaaac 
caatcttttg 
gattttcatt 
ttcatagctc 



ctcaaatcct 
ttggacatca 
aagttgtcaa 
ggaaaagaac 
aatgcgacca 
atgtttgtta 
caggtgggat 
caactagtgc 
caaatacaga 
aatcagctct 
tatatgggag 
tgccgagtcc 
ggttgatctt 
caaatcgtgc 



ggttttcgcc 
tgctgtatca 
tgcaacggaa 
cactgatctt 
atttctagaa 
cccggggaag 
tgacaaagaa 
atgtagaaga 
caatgcttct 
gatagtctgg 
tggaaataaa 
aggaacacga 
ggatcccaat 
cagcttcttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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tgctaccaca 

gcagttggca 

aagaacgttc 

gggtttattg 

aatgcacaag 

ataaccggaa 

aatgaattca 

atcacagaag 

attgatttgg 

gaaaatgctg 

tgtatggcta 

caaaa tagaa 

tggtttagct 

atatgtgtga 

ccccttgttt 

tttttttttt 

cccccccccc 

ttttttgggg 

tacagattaa 

gegcggtggt 

gtagtgtggg 

gctcagtcga 

agtaggacaa 

cgggcaggac 

gatggccttt 

gtatccgctc 

tatgagtatt 

tgtttttgct 

acgagtgggt 

cgaagaacgt 

ccgtgttgac 

ggttgagtac 

atgcagtgct 

cggaggaccg 

tgatcgttgg 

gcctgtagca 

ttcccggcaa 

ctcggccctt 

tcgcggtatc 

cacgacgggg 

ctcactgatt 

tttaaaactt 

gaccaaaatc 

caaaggatct 

accaccgcta 

ggtaactggc 

aggccaccac 

accagtggct 

gttaccggat 

ggagcgaacg 

gcttcccgaa 

gcgcacgagg 

ccacctctga 

aaacgccagc 

gttctttcct 

cccgtgcgga 

ggccttcggt 

cccgctctgg 

ccggcccggc 

tgtgtggctg 



ccatggggat 

gtggagggac 

aatgcccaag 

ccgaaccttc 

aaaatggttg 

gagaaggaac 

agttaaatag 

ctgaagtgga 

tatggtctta 

ctgattcaga 

aagaggatgg 

gtataaggaa 

tacaaattga 

tcggggcatc 

agaacggaaa 

ctactccccc 

ttcccccgat 

ggcgcggaac 

atcggccgct 

atcagaacgc 

cccacctgac 

gtctccccat 

aagactgggc 

atccgccggg 

gcccgccata 

ttgcgtttct 

atgagacaat 

caacatttcc 

cacccagaaa 

tacatcgaac 

tttccaatga 

gccgggcaag 

tcaccagtca 

gccataacca 

aaggagctaa 

gaaccggagc 

atggcaacaa 

caattaatag 

ccggctggct 

attgcagcac 

agtcaggcaa 

aagcattggt 

catttttaat 

ccttaacgtg 

tcttgagatc 

ccagcggtgg 

ttcagcagag 

ttcaagaact 

gctgccagtg 

aaggcgcagc 

acctacaccg 

gggagaaagg 

gagcttccag 

cttgagcgtc 

aacgcggcct 

gcgttatccc 

gtcggagagc 

ccctcgtgtg 

agacacgggc 

gctgctcccg 

cgatggtggc 



ccagagcgat 
tataacaagc 
atatgtaaaa 
caaaaaaagg 
ggaaggtctg 
tgcagcagac 
actcattaag 
aaagcagatt 
caatgctgaa 
gatgaacaag 
cactggttgc 
caatacttat 
cccagtcaaa 
atgctttttg 
catgcggtgc 
ccaacttcgg 
gctggaggtc 
ggcggggcca 
agcttctgtt 
agaagcggtc 
cccatgccga 
gcgagagtag 
ctttcgtttt 
agcggatttg 
aactgccagg 
acaaactctt 
aaccctgata 
gtgtcgccct 
cgctggtgaa 
tggatctcaa 
tgagcacttt 
agcaactcgg 
cagaaaagca 
tgagtgataa 
ccgctttttt 
tgaatgaagc 
cgttgcgcaa 
actggatgga 
ggtttattgc 
tggggccaga 
ctatggatga 
aactgtcaga 
ttaaaaggat 
agttttcgtt 
ctttttttct 
tttgtttgcc 
cgcagatacc 
ctgtagcacc 
gcgataagtc 
ggtcgggctg 
aactgagata 
cggacaggta 
ggggaaacgc 
gatttttgtg 
ttttacggtt 
ctgattcatt 
gctccctgag 
tcccggtcgt 
cggccccctg 
cgtgtgtcct 
gtttttgggg 



8 

gtgcaggttg 
agattgcctt 
caggaaagtt 
aaaaaaagag 
gtcgacgggt 
tacaaaagca 
aaaaccaacc 
ggcaatttaa 
cttcttgtgg 
ctgtatgagc 
tttgaaattt 
gatcacagca 
ttgagtagtg 
cttcttgcca 
actatttgta 
aggtcgacca 
gaccagatgt 
ctctggactc 
ttggcggatg 
tgataaaaca 
actcagaagt 
ggaactgcca 
atctgttgtt 
aacgttgcga 
catcaaatta 
ttgtttattt 
aatgcttcaa 
tattcccttt 
agtaaaagat 
cagcggtaag 
taaagttctg 
tcgccgcata 
tcttacggat 
cactgcggcc 
gcacaacatg 
cataccaaac 
actattaact 
ggcggataaa 
tgataaatct 
tggtaagccc 
acgaaataga 
ccaagtttac 
ctaggtgaag 
ccactgagcg 
gcgcgtaatc 
ggatcaagag 
aaatactgtc 
gcctacatac 
gtgtcttacc 
aacggggggt 
cctacagcgt 
tccggtaagc 
ctggtatctt 
atgctcgtca 
cctggccttt 
aatgcaggtc 
cgcgtgcggc 
aggaggggcc 
cgtgtggcac 
ggggttgacc 
acaggtgtcc 
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atgctaattg 
ttcaaaacat 
tattattggc 
gcctgtttgg 
ggtacggttt 
cccaatcggc 
agcaatttga 
ttaactggac 
caatggaaaa 
gagtgaggaa 
ttcataaatg 
aatacagaga 
gctacaaaga 
ttgcaatggg 
tataggtttg 
gtactccggg 
ccgaaagtgt 
tttttttttt 
agagaagatt 
gaatttgcct- 
gaaacgccgt 
ggcatcaaat 
tgtcggtgaa 
agcaacggcc 
agcagaaggc 
ttctaaatac 
taatattgaa 
tttgcggcat 
gctgaagatc 
atccttgaga 
ctatgtggcg 
cactattctc 
ggcatgacag 
aacttacttc 
ggggatcatg 
gacgagcgtg 
ggcgaactac 
gttgcaggac 
ggagccggtg 
tcccgtatcg 
cagatcgctg 
tcatatatac 
atcctttttg 
tcagaccccg 
tgctgcttgc 
ctaccaactc 
cttctagtgt 
ctcgctctgc 
gggttggact 
tcgtgcacac 
gagctatgag 
ggcagggtcg 
tatagtcctg 
ggggggcgga 
tgctggcctt 
acgatccttt 
ccgagaggtc 
ggccgaaaat 
gggcggccgg 
agagggcccc 
gtgtccgtgt 



cgaaggggaa 900 
aaatagcaga 960 
aactgggatg 1020 
tgctatagca 1080 
caggcatcag 1140 
aattgatcag 1200 
gctaatagat 1260 
caaagactcc 1320 
ccagcacact 1380 
acaattaagg 1440 
tgacgatgat 1500 
agaagcgatg 1560 
tgtgatactt 1620 
ccttgttttc 1680 
gaaaaaaaca 1740 
cgaaactttg 1800 
cccccccccc 1860 
tttttttttt 1920 
ttcagcctga 1980 
ggcggcagta 2040 
agcgccgatg 2100 
aaaacgaaag 2160 
cgctctcctg 2220 
cggagggtgg 2280 
catcctgacg 2340 
attcaaatat 2400 
aaaggaagag 2460 
tttgccttcc 2520 
agttgggtgc 2580 
gttttcgccc 2640 
cggtattatc 2700 
agaatgactt 2760 
taagagaatt 2820 
tgacaacgat 2 8 80 
taactcgcct 2940 
acaccacgat 3000 
t tact ct age 3060 
cacttctgcg 3120 
agcgtgggtc 3180 
tagttatcta 3240 
agataggtgc 3300 
tttagattga 3360 
ataatctcat 3420 
tagaaaagat 3480 
aaacaaaaaa 3540 
tttttccgaa 3600 
agccgtagtt 3660 
taatcctgtt 3720 
caagacgata 3780 
agcccagctt 3840 
aaagcgccac 3900 
gaacaggaga 3960 
tcgggtttcg 4020 
gcctatggaa 4080 
ttgctcacat 4140 
ctggcgagtc 4200 
gcgcctggcc 4260 
gcttccggct 4320 
gagggcgtcc 4380 
gggcgctccg 4440 
cgcgcgtcgc 4500 
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ctgggccggc ggcgtggtcg gtgacgcgac ctcccggccc cgggggaggt atatctttcg 4560 
ctccgagtcg gcattttggg ccgccgggtt attagtagaa acaggggtac 4 610 
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<210> 7 
<211> 3558 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pHL2583 



<400> 7 

tattagtaga 

gcagtactgt 

atgaacctga 

ggtgaaaacg 

actcacccag 

ggccaggttt 

atcgtcgtgg 

gtaacaaggg 

ttccggatga 

cttatttttc 

ggtacattga 

atcaacggtg 

tactcccccc 

tcccccgatg 

gcgcggaacg 

tcggccgcta 

tcagaacgca 

ccacctgacc 

tctccccatg 

agactgggcc 

tccgccggga 

cccgccataa 

tgcgtttcta 

tgagacaata 

aacatttccg 

acccagaaac 

acatcgaact 

ttccaatgat 

ccgggcaaga 

caccagtcac 

ccataaccat 

aggagctaac 

aaccggagct 

tggcaacaac 

aattaataga 

cggctggctg 

ttgcagcact 

gtcaggcaac 

agcattggta 

atttttaatt 

cttaacgtga 

cttgagatcc 

cagcggtggt 

tcagcagagc 

tcaagaactc 

ctgccagtgg 

aggcgcagcg 

cctacaccga 



aacagggtat 

tgtaattcat 

atcgccagcg 

ggggcgaaqa. 

ggattggctg 

tcaccgtaac 

tattcactcc 

tgaacactat 

gcattcatca 

tttacggtct 

gcaactgact 

gtatatccag 

caacttcgga 

ctggaggtcg 

gcggggccac 

gcttctgttt 

gaagcggtct 

ccatgccgaa 

cgagagtagg 

tttcgtttta 

gcggatttga 

actgccaggc 

caaactcttt 

accctgataa 

tgtcgccctt 

gctggtgaaa 

ggatctcaac 

gagcactttt 

gcaactcggt 

agaaaagcat 

gagtgataac 

cgcttttttg 

gaatgaagcc 

gttgcgcaaa 

ctggatggag 

gtttattgct 

ggggccagat 

tatggatgaa 

actgtcagac 

taaaaggatc 

gttttcgttc 

tttttttctg 

ttgtttgccg 

gcagatacca 

tgtagcaccg 

cgataagtcg 

gtcgggctga 

actgagatac 



tttttattct 

taagcattct 

gcatcagcac 

agttgtccat 

agacgaaaaa 

acgccacatc 

agagcgatga 

cccatatcac 

ggcgggcaag 

ttaaaaaggc 

gaaatgcctc 

tgattttttt 

ggtcgaccag 

accagatgtc 

tctggactct 

tggcggatga 

gataaaacag 

ctcagaagtg 

gaactgccag 

tctgttgttt 

acgttgcgaa 

atcaaattaa 

tgtttatttt 

atgcttcaat 

attccctttt 

gtaaaagatg 

agcggtaaga 

aaagttctgc 

cgccgcatac 

cttacggatg 

actgcggcca 

cacaacatgg 

ataccaaacg 

ctattaactg 

gcggataaag 

gataaatctg 

ggtaagccct 

cgaaatagac 

caagtttact 

taggtgaaga 

cactgagcgt 

cgcgtaatct 

gatcaagagc 

aatactgtcc 

cctacatacc 

tgtcttaccg 

acggggggtt 

ctacagcgtg 



agtacattac 

gccgacatgg 

cttgtcgcct 

attggccacg 

catattctca 

ttgcgaatat 

aaacgtttca 

cagctcaccg 

aatgtgaata 

cgtaatatcc 

aaaatgttct 

ctccatgatt 

tactccgggc 

cgaaagtgtc 

tttttttttt 

gagaagattt 

aatttgcctg 

aaacgccgta 

gcatcaaata 

gtcggtgaac 

gcaacggccc 

gcagaaggcc 

tctaaataca 

aatattgaaa 

ttgcggcatt 

ctgaagatca 

tccttgagag 

tatgtggcgc 

actattctca 

gcatgacagt 

acttacttct 

gggatcatgt 

acgagcgtga 

gcgaactact 

ttgcaggacc 

gagccggtga 

cccgtatcgt 

agatcgctga 

catatatact 

tcctttttga 

cagaccccgt 

gctgcttgca 

taccaactct 

ttctagtgta 

tcgctctgct 

ggttggactc 

cgtgcacaca 

agctatgaga 



gccccgccct 

aagccatcac 

tgcgtataat 

tttaaatcaa 

ataaaccctt 

atgtgtagaa 

gtttgctcat 

tctttcattg 

aaggccggat 

agctgaacgg 

ttacgatgcc 

atggccatta 

gaaactttgt 

cccccccccc 

tttttttttt 

tcagcctgat 

gcggcagtag 

gcgccgatgg 

aaacgaaagg 

gctctcctga 

ggagggtggc 

atcctgacgg 

ttcaaatatg 

aaggaagagt 

ttgccttcct 

gttgggtgca 

ttttcgcccc 

ggtattatcc 

gaatgacttg 

aagagaatta 

gacaacgatc 

aactcgcctt 

caccacgatg 

tactctagct 

acttctgcgc 

gcgtgggtct 

agttatctac 

gataggtgcc 

ttagattgat 

taatctcatg 

agaaaagatc 

aacaaaaaaa 

ttttccgaag 

gccgtagtta 

aatcctgtta 

aagacgatag 

gcccagcttg 

aagcgccacg 



gccactcatc 

agacggcatg 

atttgcccat 

aactggtgaa 

tagggaaata 

actgccggaa 

ggaaaacggt 

ccatacggaa 

aaaacttgtg 

tctggttata 

attgggatat 

cccttgtttc 

tttttttttt 

cccccccccg 

tttttgggga 

acagattaaa 

cgcggtggtc 

tagtgtgggg 

ctcagtcgaa 

gtaggacaaa 

gggcaggacg 

atggcctttt 

tatccgctca 

atgagtattc 

gtttttgctc 

cgagtgggtt 

gaagaacgtt 

cgtgttgacg 

gttgagtact 

tgcagtgctg 

ggaggaccga 

gatcgttggg 

cctgtageaa 

tcccggcaac 

tcggcccttc 

cgcggtatca 

acgacgggga 

tcactgatta 

ttaaaacttc 

accaaaatcc 

aaaggatctt 

ccaccgctac 

gtaactggct 

ggccaccact 

ccagtggctg 

ttaccggata 

gagcgaacga 

cttcccgaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 
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ggagaaaggc 
agctt ccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
tcggagagcg 
cctcgtgtgt 
gacacgggcc 
ctgctcccgc 
gatggtggcg 
gcgtggtcgg 
cattttgggc 



ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattcatta 
ctccctgagc 
cccggtcgta 
ggccccctgc 
gtgtgtcctg 
tttttgggga 
tgacgcgacc 
cgccgggt 



ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
atgcaggtca 
gcgtgcggcc 
ggaggggccg 
gtgtggcacg 
gggttgacca 
caggtgtccg 
tcccggcccc 



gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 
cgatcctttc 
cgagaggtcg 
gccgaaaatg 
ggcggccggg 
gagggccccg 
tgtccgtgtc 
gggggaggta 



aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tggcgagtcc 
cgcctggccg 
cttccggctc 
agggcgtccc 
ggcgctccgt 
gcgcgtcgcc 
tatctttcgc 



cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
ccgtgcggag 
gccttcggtc 
ccgctctgga 
cggcccggcg 
gtgtggctgc 
tgggccggcg 
tccgagtcgg 
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3558 
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<210> 8 
<211> 4343 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pHL2989 



<400> 8 

ctttctggcg 

ggtcgcgcct 

aaatgcttcc 

ccgggagggc 

ccccgggcgc 

gtgtcgcgcg 

aggtatatct 

tattttttat 

ctggagttgt 

tcagtggaga 

ctactggaaa 

aatgcttttc 

ccgaaggtta 

gtgctgaagt 

attttaaaga 

atgtatacat 

acaacatcga 

gcgatggccc 

aagatcccaa 

ttacacatgg 

catggtctag 

attcattaag 

ccagcggcat 

cgaagaagtt 

tggcactcac 

cgtaacacgc 

cactccagag 

cactatccca 

tcatcaggcg 

cggtctttaa 

ctgactgaaa 

atccagtgat 

acttcggagg 

ggaggtcgac 

ggggccactc 

ttctgttttg 

agcggtctga 

atgccgaact 



agtccccgtg 
ggccggcctt 
ggctcccgct 
gtccccggcc 
tccgtgtgtg 
tcgcctgggc 
ttcgctccga 
actagtaagc 
cccaattctt 
gggtgaaggt 
actacctgtt 
aagataccca 
tgtacaggaa 
caagtttgaa 
agatggaaac 
catggctgac 
ggacggcggc 
tgtcctttta 
cgaaaagaga 
catggatgaa 
atatctagta 
cattctgccg 
cagcaccttg 
gtccatattg 
aaagaacatg 
cacatcttgc 
cgatgaaaac 
tatcaccagc 
ggcaagaatg 
aaaggccgta 
tgcctcaaaa 
ttttttctcc 
tcgaccagta 
cagatgtccg 
tggactcttt 
gcggatgaga 
taaaacagaa 
cagaagtgaa 



cggagtcgga 
cggtccctcg 
ctggagacac 
cggcgctgct 
gctgcgatgg 
cggcggcgtg 
gtcggcattt 
tcgaaggagt 
gttgaattag 
gatgcaacat 
ccatggccaa 
gatcatatga 
agaactatat 
ggtgataccc 
attcttggac 
aagcagaaga 
gtgcagctgg 
ccagacaacc 
gaccacatgg 
ctatacaagg 
cattacgccc 
acatggaagc 
tcgccttgcg 
gccacgttta 
ttctcgatga 
gaatatatgt 
gtttcagttt 
tcaccgtctt 
tgaataaagg 
atatccagct 
tgttctttac 
atgattatgc 
ctccgggcga 
aaagtgtccc 
tttttttttt 
gaagattttc 
tttgcctggc 
acgccgtagc 



gagcgctccc 
tgtgtcccgg 
gggccggccc 
cccgcgtgtg 
tggcgttttt 
gtcggtgacg 
tgggccgccg 
ccaccatgag 
atggtgatgt 
acggaaaact 
cacttgtcac 
aacagcatga 
ttttcaaaga 
ttgttaatag 
acaaattgga 
acggaatcaa 
ccgaccacta 
attacctgtc 
tccttcttga 
gatcccatca 
cgccctgcca 
catcacagac 
tataatattt 
aatcaaaact 
atcctttagg 
gtagaaactg 
gctcatggaa 
tcattgccat 
ccggataaaa 
gaacggtctg 
gatgccattg 
aaaaaatacc 
aactttgttt 
cccccccccc 
tttttttttt 
agcctgatac 
ggcagtagcg 
gccgatggta 



tgagcgcgtg 
tcgtaggagg 
cctgcgtgtg 
tcctggggtt 
ggggacaggt 
cgacctcccg 
ggttattagt 
taaaggagaa 
taatgggcac 
tacccttaaa 
tactttcact 
ctttttcaag 
tgacgggaac 
aatcgagtta 
atacaactat 
ggccaacttc 
ccagcagaac 
cacacaatct 
gtttgtaaca 
ccatcaccat 
ctcatcgcag 
ggcatgatga 
gcccatggtg 
ggtgaaactc 
gaagtaggcc 
ccggaaatcg 
aacggtgtaa 
acggaattcc 
cttgtgctta 
gttataggta 
ggatatatca 
cttgtttcta 
tttttttttc 
cccccccggc 
tttggggatc 
agattaaatc 
cggtggtccc 
gtgtggggtc 



cggcccgaga 
ggccggccga 
gcacgggcgg 
gaccagaggg 
gtccgtgtcc 
gccccggggg 
agaaacaggg 
gaacttttca 
aaattttctg 
tttatttgca 
tatggtgttc 
agtgccatgc 
tacaagacac 
aaaggtattg 
aactcacaca 
aagacccgcc 
accccaattg 
gccctttcga 
gctgctggga 
cactaagctc 
tactgttgta 
acctgaatcg 
aaaacggggg 
acccagggat 
aggttttcac 
tcgtggtatt 
caagggtgaa 
ggatgagcat 
tttttcttta 
cattgagcaa 
acggtggtat 
ctccccccca 
ccccgatgct 
gcggaacggc 
ggccgctagc 
agaacgcaga 
acctgacccc 
tcccca tgcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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agagtaggga 
tcgttttatc 
ggatttgaac 
tgccaggcat 
aactcttttg 
cctgataaat 
tcgcccttat 
tggtgaaagt 
atctcaacag 
gcacttttaa 
aactcggtcg 
aaaagcatct 
gtgataacac 
cttttttgca 
atgaagccat 
tgcgcaaact 
ggatggaggc 
ttattgctga 
ggccagatgg 
tggatgaacg 
t^tcagacca 
aaaggatcta 
tttcgttcca 
tttttctgcg 
gtttgccgga 
agataccaaa 
tagcaccgcc 
ataagtcgtg 
cgggctgaac 
tgagatacct 
acaggtatcc 
gaaacgcctg 
ttttgtgatg 
tacggttcct 
attcattaat 



actgccaggc 
tgttgtttgt 
gttgcgaagc 
caaattaagc 
tttatttttc 
gcttcaataa 
tccctttttt 
aaaagatgct 
cggtaagatc 
agttctgcta 
ccgcatacac 
tacggatggc 
tgcggccaac 
caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgtca 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tcttaccggg 
ggggggttcg 
acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
gcaggtcacg 



atcaaataaa 
cggtgaacgc 
aacggcccgg 
agaaggccat 
taaatacatt 
tattgaaaaa 
gcggcatttt 
gaagatcagt 
cttgagagtt 
tgtggcgcgg 
tattctcaga 
atgacagtaa 
ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
^gggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
atG 



11 

acgaaaggct 
tctcctgagt 
^gggtggcgg 
cctgacggat 
caaatatgta 
ggaagagtat 
gccttcctgt 
tgggtgcacg 
ttcgccccga 
tattatcccg 
atgacttggt 
gagaattatg 
caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 



cagtcgaaag 
aggacaaatc 
gcaggacgcc 
ggcctttttg 
t ccgctcatg 
gagtattcaa 
ttttgctcac 
agtgggttac 
agaacgtttt 
tgttgacgcc 
tgagtactca 
cagtgctgcc 
aggaccgaag 
tcgttgggaa 
tgtagcaatg 
ccggcaacaa 
ggcccttccg 
cggtatcatt 
gacggggagt 
actgattaag 
aaaacttcat 
caaaatccct 
aggatcttct 
accgctacca 
aactggcttc 
ccaccacttc 
agtggctgct 
accggataag 
gcgaacgacc 
tcccgaaggg 
cacgagggag 
cctctgactt 
cgccagcaac 
ctttcctgcg 



actgggcctt 
cgccgggagc 
cgccataaac 
cgtttctaca 
agacaataac 
catttccgtg 
ccagaaacgc 
atcgaactgg 
ccaatgatga 
gggcaagagc 
ccagtcacag 
ataaccatga 
gagctaaccg 
ccggagctga 
gcaacaacgt 
ttaatagact 
gctggctggt 
gcagcactgg 
caggcaacta 
cattggtaac 
ttttaattta 
taacgtgagt 
tgagatcctt 
gcggtggttt 
agcagagcgc 
aagaactctg 
gccagtggcg 
gcgcagcggt 
tacaccgaac 
agaaaggcgg 
cttccagggg 
gagcgtcgat 
gcggcctttt 
ttatcccctg 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4343 



<210> 9 

<211> 3888 

40 <212> DNA 

<213> Artificial Sequence 



45 



50 



55 



60 



<220> 

<223> Description of Artificial Sequence: pHL1920 



<400> 9 

cccaaaaaaa 

gggggggggg 

aaaaaaaaac 

aacagggtag 

cactataggg 

gacgagattt 

cgttgatata 

atgtacctat 

aaataagcac 

tccggaattc 

ttgttacacc 

cgacgatttc 

cctggcctat 

ggtgagtttc 

tttcaccatg 



aaaaaaaaaa 
ggggggggga 
aaagtttcgc 
ataatcactc 
cgaattgggt 
tcaggagcta 
tcccaatggc 
aaccagaccg 
aagttttatc 
cgtatggcaa 
gttttccatg 
cggcagtttc 
ttccctaaag 
accagttttg 
ggcaaatatt 



aaaaaaaaag 
cactttcgga 
ccggagtact 
actgagtgac 
accgggcccc 
aggaagctaa 
atcgtaaaga 
ttcagctgga 
cggcctttat 
tgaaagacgg 
agcaaactga 
tacacatata 
ggtttattga 
atttaaacgt 
atacgcaagg 



agtccagagt 
catctggtcg 
ggtcgacctc 
atccacatcg 
ccctcgaggt 
aatggagaaa 
acattttgag 
tattacggcc 
tcacattctt 
tgagctggtg 
aacgttttca 
ttcgcaagat 
gaatatgttt 
ggccaatatg 
cgacaaggtg 



ggccccgccg 
acctccagca 
cgaagttggg 
cgagcgcgcg 
cgacggtatc 
aaaatcactg 
gcatttcagt 
tttttaaaga 
gcccgcctga 
atatgggata 
tcgctctgga 
gtggcgtgtt 
ttcgtctcag 
gacaacttct 
ctgatgccgc 



ttccgcgccg 
tcgggggaaa 
ggggagtaga 
taatacgact 
gataagcttc 
gatataccac 
cagttgctca 
ccgtaaagaa 
tgaatgctca 
gtgttcaccc 
gtgaatacca 
acggtgaaaa 
ccaatccctg 
tcgcccccgt 
tggcgattca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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ggttcatcat 
gtactgcgat 
aacgcctggt 
aagcttgata 
5 gcggtggagc 
ggtaaagaaa 
gcgaaagata 
ggcgacgcgc 
cgcccggggc 
10 ccctcccggc 
gaagcatttt 
ggcgcgacct 
tcgccagaaa 
tgagcaaaag 
15 cataggctcc 
aacccgacag 
cctgttccga 
gcgctttctc 
ctgggctgtg 
20 cgtcttgagt 
aggattagca 
tacggctaca 
ggaaaaagag 
tttgtttgca 
25 ttttctacgg 
agattatcaa 
atctaaagta 
cctatctcag 
ataactacga 
30 ccacgctcac 
agaagtggtc 
agagtaagta 
gtggtgtcac 
cgagttacat 
35 gttgtcagaa 
tctcttactg 
tcattctgag 
aataccgcgc 
cgaaaactct 
40 cccaactgat 
^ aggcaaaatg 
ttcctttttc 
tttgaatgta 
caggatggcc 
45 ccctccgggc 
gagagcgttc 
tcgttttatt 
cggcgctacg 
tgccgccagg 
50 ggctgaaaat 



gccgtttgtg 

gagtggcagg 

gctacgcctg 

tcgaattcct 

tccagctttt 

aatacccttg 

tacctccccc 

gacacggaca 

cctctggtca 

cgcccgtgcc 

cggccggccc 

ctcgggccgc 

ggatcgtgac 

gccagcaaaa 

gcccccctga 

gactataaag 

ccctgccgct 

atagctcacg 

tgcacgaacc 

ccaacccggt 

gagcgaggta 

ctagaaggac 

ttggtagctc 

agcagcagat 

ggtctgacgc 

aaaggatctt 

tatatgagta 

cgatctgtct 

tacgggaggg 

cggctccaga 

ctgcaacttt 

gttcgccagt 

gctcgtcgtt 

gatcccccat 

gtaagttggc 

tcatgccatc 

aatagtgtat 

cacatagcag 

caaggatctt 

cttcagcatc 

ccgcaaaaaa 

aatattattg 

tttagaaaaa 

ttctgcttaa 

cgttgcttcg 

accgacaaac 

tgatgcctgg 

gcgtttcact 

caaattctgt 

cttctctcat 



atggcttcca 

gcggggcgta 

aataagtgat 

gcagcccggg 

gttcccttta 

attcttctaa 

ggggccggga 

cctgtcccca 

accccaggac 

acacgcaggg 

ctcctacgac 

acgcgcgctc 

ctgcattaat 

ggccaggaac 

cgagcatcac 

ataccaggcg 

taccggatac 

ctgtaggtat 

ccccgttcag 

aagacacgac 

tgtaggcggt 

agtatttggt 

ttgatccggc 

tacgcgcaga 

tcagtggaac 

cacctagatc 

aacttggtct 

atttcgttca 

cttaccatct 

tttatcagca 

atccgcctcc 

taatagtttg 

tggtatggct 

gttgtgcaaa 

cgcagtgtta 

cgtaagatgc 

gcggcgaccg 

aactttaaaa 

accgctgttg 

ttttactttc 

gggaataagg 

aagcatttat 

taaacaaaag 

tttgatgcct 

caacgttcaa 

aacagataaa 

cagttcccta 

tctgagttcg 

tttatcagac 

ccgccaaaac 



tgtcggcaga 

atttttttaa 

aataagcgga 

ggatccacta 

gtgagggtta 

taacccggcg 

ggtcgcgtca 

aaaacgccac 

acacgcggga 

ggccggcccg 

c^gggscacac 

agggagcgct 

gaatcagggg 

cgtaaaaagg 

aaaaatcgac 

tttccccctg 

ctgtccgcct 

ctcagttcgg 

cccgaccgct 

ttatcgccac 

gctacagagt 

atctgcgctc 

aaacaaacca 

aaaaaaggat 

gaaaactcac 

cttttaaatt 

gacagttacc 

tccatagttg 

ggccccagtg 

ataaaccagc 

atccagtcta 

cgcaacgttg 

tcattcagct 

aaagcggtta 

tcactcatgg 

ttttctgtga 

agttgctctt 

gtgctcatca 

agatccagtt 

accagcgttt 

gcgacacgga 

cagggttatt 

agtttgtaga 

ggcagtttat 

atccgctccc 

acgaaaggcc 

ctctcgcatg 

gcatggggtc 

cgcttctgcg 

agaagctagc 



atgcttaatg 

ggcagttatt 

tgaatggcag 

gttctagagc 

attgcgcgca 

gcccaaaatg 

ccgaccacgc 

catcgcagcc 

gcagcgccgg 

tgtctccaga 

gagggaccga 

ctccgactcc 

ataacgcagg 

ccgcgttgct 

gctcaagtca 

gaagctccct 

ttctcccttc 

tgtaggtcgt 

gcgccttatc 

tggcagcagc 

tcttgaagtg 

tgctgaagcc 

ccgctggtag 

ctcaagaaga 

gttaagggat 

aaaaatgaag 

aatgcttaat 

cctgactccc 

ctgcaatgat 

cagccggaag 

ttaattgttg 

ttgccattgc 

ccggttccca 

gctccttcgg 

ttatggcagc 

ctggtgagta 

gcccggcgtc 

ttggaaaacg 

cgatgtaacc 

ctgggtgagc 

aatgttgaat 

gtctcatgag 

aacgcaaaaa 

ggcgggcgtc 

ggcggatttg 

cagtctttcg 

gggagacccc 

aggtgggacc 

ttctgattta 

ggccgatc 



aattacaaca 

ggtgccctta 

aaa ttcgtcg 

ggccgccacc 

ggcctagcta 

ccgactcgga 

cgccggccca 

acacacggag 

gccggggacg 

gcgggagccg 

aggccggcca 

gcacggggac 

aaagaacatg 

ggcgtttttc 

gaggtggcga 

cgtgcgctct 

gggaagcgtg 

tcgctccaag 

cggtaactat 

cactggtaac 

gtggcctaac 

agttaccttc 

cggtggtttt 

tcctttgatc 

tttggtcatg 

ttttaaatca 

cagtgaggca 

cgtcgtgtag 

accgcgagac 

ggccgagcgc 

ccgggaagct 

tacaggcatc 

acgatcaagg 

tcctccgatc 

actgcataat 

ctcaaccaag 

aacacgggat 

ttcttcgggg 

cactcgtgca 

aaaaacagga 

actcatactc 

cggatacata 

ggccatccgt 

ctgcccgcca 

tcctactcag 

actgagcctt 

acactaccat 

accgcgctac 

atctgtatca 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3888 
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<211> 12 

55 <212> RNA 

<213> Influenza A virus 
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<210> 11 
<211> 12 
<212> RNA 

<213> Influenza B virus 

5 

<400> 11 
nnygcuucug cu 

10 <210> 12 
<211> 12 
<212> RNA 

<213> Influenza C virus 
15 <400> 12 

ccugcuucug cu ]_2 

<210> 13 

20 <211> 12 

<212> RNA 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Modified 
influenza A 3* sequence (pHL1104 and 1920) 



30 



40 



55 



<400> 13 

ccuguuucua cu X2 



<210> 14 

<211> 12 

<212> RNA 

35 <213> Artificial Sequence 
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<223> Description of Artificial Sequence: Modified 
influenza A 3' sequence (pHL1948) 

<400> 14 

ccucguucuc cu 12 



45 <210> 15 
<211> 13 
<212> RNA 

<213> Artificial Sequence 
50 <220> 

<223> Description of Artificial Sequence: Modified 
influenza A 5' sequence (pHL1920) 
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agaagaauca agg 23 
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aguagaaaca agg 13 

<210> 17 
<211> 13 
<212> RNA 

<213> Influenza B virus 
<400> 17 

aguagwaaca rnn 13 



15 <210> 18 
<211> 13 
<212> RNA 

<213> Influenza C virus 
20 <400> 18 

agcaguagca agr 13 



<210> 19 
25 <211> 21 
<212> RNA 

<213> Influenza A virus 
<400> 19 

30 aguagaaaca aggnnnuuuu u 21 

<210> 20 

<211> 21 

35 <212> RNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Modified 
40 influenza A 5 '-sequence {pHLl920) 

<400> 20 

agaagaauca aggnnnuuuu u 21 
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<2ia> 21 

<211> 21 
<212> RNA 

<213> Influenza B virus 
<400> 21 

aguagwaaca rnnnnnuuuu u 21 

55 <210> 22 
<211> 19 
<212> RNA 

<213> Artificial Sequence 
60 <220> 

<223> Description of Artificial Sequence: Modified 
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influenza C 5 ' sequence 
<400> 22 

aguaguaaca agrguuuuu 19 

<210> 23 
<211> 15 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified 
influenza A 3' sequence (pHL1104 and 1920) 

<400> 23 

nnnccucuuu cuacu 15 
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<223> Description of Artificial Sequence: Modified 
influenza A 3* sequence {pHL1948) 

<400> 24 

nnnccucguu cuccu 15 
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<223> Description of Artificial Sequence: Modified 
influenza B 3* sequence 
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Sta bl e Recombinant Infl ue nza Vir uses Free of Hel per Viruses 

5 Field of the Invention 

The invention relates to a recombinant Influenza virus for high-yield 
expression of incorporated foreign gene(s), which is genetically stable In 
the absence of any helper virus; a method for the production of said 
recombinant Influenza virus, a method for constructing specific ribozyme- 
10 sensitive influenza carrier strains; pharmaceutical compositions comprising 
said recombinant Influenza viruses; and the use of said recombinant 
influenza virus for preparing medicaments for vaccination purposes. 

Technical Background 

15 Redesigning Influenza virus Into a vector system for expression of foreign 
genes similar to what has been achieved In several other thoroughly 
studied viruses such as adenovirus, retrovirus, Semliki Forest virus or 
Rabies virus has the advantage of an industrially well established mode of 
cheap propagation for influenza in fertilized chicken eggs leading to rather 

20 high titers (above 10^°/ml). On the other hand none of Its genes may be 
deleted from the influenza genome according to our present knowledge, 
and give room for large-size foreign Insertions. Only small fragments of 

. foreign polypeptide chains such as B cell epitopes (10 to 15 amino acids) 
may be inserted into selected positions within two of the viral proteins. I.e. 

25 in exchange for one of the variable antigenic regions located at the surface 
of hemagglutinin (Muster et al., Mucosal model of immunization against 
human immunodeficiency virus type 1 with a chimeric influenza virus. J. 
Virol. 69 (11), 6678-6686 (1995)), or into the stalk sequence of viral 
neuraminidase (Garcla-Sastre and Palese, The cytoplasmic tail of the 

30 neuraminidase protein of Influenza A virus does not play an important role 
in the packaging of this protein into viral envelopes. Virus Res. 37, 37-47 
(1995)), and be stably maintained as functional fusion proteins. 
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Constructs of this kind turned out to be useful for experimental 
vaccination in a few cases studied, but only rather few clearly defined 
epitope sequences (of ten to twelve amino acids each) are known today, 
and some of them might also be misfolded within such restricted fusion 
5 protein positions, or in other cases interfere with the correct tertiary 
structure and function of their host polypeptide chains. 
Incorporation of a full-size foreign protein Into influenza via reverse 
genetics, encoded by an independent ninth vRNA molecule in addition to 
its regular set of eight standard vRNA segments is without special 

10 provisions only transiently possible (Luytjes et al.. Amplification, 
expression, and packaging of a foreign gene by influenza virus. Cell 59, 
1107-1113 (1989); Enami et al.. An influenza virus containing nine 
different RNA segments. Virology 185, 291-298 (1991)). In the absence of 
a continuous selective pressure any additional recombinant vRNA segment 

15 cannot be stably maintained as long as a wildtype promoter sequence is 
used on that ninth vRNA segment, and it will inadvertently be lost after 
few steps of propagation. 

Using a different system of influenza reverse genetics developed in our 
laboratory (Zobel et al., RNA polymerase I catalysed transcription of Insert 

20 viral cDNA. Nucleic Acids Res. 21, 3507-3614 (1993); Neumann et al., 
RNA polymerase I-mediated expression of influenza viral RNA molecules. 
Virology 202, 477-479 (1994)), which was built around in vivo synthesis 
of recombinant vRNA molecules by cellular RNA polymerase I transcription 
of the respective template cDNA constructs, promoter-up mutations have 

25 been designed by nucleotide substitutions (Neumann and Hobom, 
Mutational analysis of influenza virus promoter elements in vivo, 3. Gen. 
Virol. 76, 1709-1717 (1995)). When these are attached to a recombinant 
ninth vRNA segment its increased transcription and amplification rate will 
not only compensate for the losses suffered spontaneously, but even 

30 cause accumulation of the foreign vRNA segment during simple viral 
passaging, in the absence of any selection. However, due to its over- 
replication relative to all of the regular influenza vRNA segments (which of 
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course are connected to wild-type promoter sequences) after catching up 
with the others the foreign segment will become over-abundant. This 
increasingly will result in viral particles that have incorporated several 
copies of recombinant vRNA, but no longer have a full set of all eight 
5 standard segments incorporated among an average of about 15 vRNA 
molecules present within a virion. Such particles are defective and will not 
result in plaque formation, hence after an initial increase of recombinant 
viral particles during the first steps of propagation a dramatic decrease is 
observed, usually at the third or fourth step of viral passaging, depending 

10 on the size of the recombinant vRNA and the level of the promoter-up 
mutation attached. A balanced situation with regard to the insert length 
and the level of promoter activity can be achieved, and has been 
propagated in a particular case over 11 passages, with essentially stable 
levels of recombinant viruses among a majority of helper viruses (around 

15 80%) during these steps. If a full-level promoter-up mutation is used 
(1104 or the new variant 1920, see below) a balanced-level propagation is 
reached in conjunction with a recombinant vRNA size of 4000 nucleotides 
(Maysa Azzeh, Ph.D. Thesis, Univ, Giessen (2000)). 

In all of these preparations, both in transiently achieved increased yields 
20 (up to 40% of recombinants after three or four steps of viral passage), 
and in a balanced propagation of recombinant influenza viruses (10 - 
20%) the respective viral progeny inadvertantly constitute mixtures with a 
majority of non-recombinant helper viruses. These result both from a 
statistical mode of packaging vRNA molecules into a virion (the ninth 
25 segment may not be co-packaged), and from the fraction of cells solely 
infected by helper virus. 

The problems of fractional yields and of instability during viral propagation 
of recombinant influenza are the problems to be solved with the present 
invention. 



30 
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Summary of the Invention 

Starting out from two observations In this laboratory which are discussed 
below and which concern two hitherto unsuspected properties of influenza 
viral RNA polymerase in its Interaction with terminally adapted influenza- 
5 specific RNA molecules, a new technique for the construction of stable 
recombinant influenza viruses was found. 

As previously described in WO 96/10641 plasmid constructs are designed 
to generate influenza vRNA-like molecules in vivo by cellular RNA 
polymerase I transcription following plasmid DNA transfection into tissue 

10 culture cells, and to this end contain flanking rDNA promoter and 
terminator elements, externally located relative to any cDNA constructs in 
between. The resulting recombinant vRNA molecules are designed to 
contain 5' and 3' recognition sequences for influenza viral RNA 
polymerase, which however carry up to five nucleotide substitutions (in 

15 promoter-up mutant pHL1920) resulting in a substantial increase of 
expression over wildtype influenza promoter levels. While recombinant 
pseudoviral RNA is initially transcribed by RNA polymerase I, further 
amplification and mRNA transcription depends on the presence of viral 
RNA polymerase and viral nucleoprotein in the cell, which generally are 

20 provided by infection of a helper virus. As a consequence the progeny viral 
yield will constitute a mixture of recombinant viruses together with a 
majority of wild-type helper viruses. 

In the new technique the recombinant vRNA-like molecules as transcribed 
by RNA polymerase I are constructed as ambisense RNA double segments, 

25 with one reading frame (an influenza gene) in sense and a second one (a 
foreign gene) in anti-sense orientation, or vice versa. In such constructs 
both reading frames will be expressed via the cap-snatching mode, even if 
at different levels. Again, infection by helper virus is required to provide 
the necessary viral early and late proteins for genetic expression and 

30 virion packaging. However, the particular helper virus used in the new 
method is a recombinant virus carrying 2x2 specifically designed ribozyme 
targets inserted into the flanking non-coding sequences of one of its eight 
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vRNA segments. The viral segment chosen to become rfbozyme-sensitive 
is always identical to the one used in constructing the recombinant 
amblsense RNA molecule, as t,he viral carrier gene in covalent linkage with 
an additional foreign gene. 

Recombinant Influenza viruses produced in this way through RNA 
polymerase I transcription of an ambisense viral RNA molecule followed by 
infection with that specifically designed type of ribozyme-sensltive helper 
virus will carry one of the influenza genes twice, once within that 
ribozyme-sensltive helper vRNA segment, and a second time within the 
ribozyme-resistant amblsense segment. Recombinant viruses of this type 
are again obtained initially only as a mixture together with a majority of 
non-recombinant helper viruses. A progeny viral passage through tissue 
culture cells (293T) which before have been transiently transfected with 
plasmid constructs expressing the respective double-headed ribozyme will 
(in one step) inactivate the ribozyme-sensltive segment by a factor of up 
to 100. One or two rounds of such ribozyme treatment in vivo will at the 
same time (a) purify the recombinant virus from its non-recombinant 
helper contaminants, and (b) delete the sensitive vRNA helper segment 
from within the initial (additive) recombinant virus. 

As a result recombinant influenza viruses are isolated along this several- 
step procedure, which are free of contaminating helper viruses and carry 
seven regular and one ambisense vRNA segments, all in a balanced 
replication mode. Their recombinant nature is stably maintained because 
of a covalent junction between one of the viral genes and the full-size 
foreign gene inserted, a situation achieved here for the first time, via 
constructing an influenza ambisense RNA segment. The whole procedure 
is independent of any (selectable) phenotype, and can be applied to either 
of the eight influenza vRNA segments. After establishing a first ambisense 
vRNA segment carrying a single foreign gene it can also be repeated all 
over for inserting a second foreign gene within another ambisense RNA 
segment of the same constitution in principle. 
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Stable recombinant viruses of the type described can be used for cheap 
propagation In fertilized eggs, either for production of those recombinant 
viruses themselves or for production of foreign proteins or glycoproteins 
encoded by them, and hence find application in (glyco)protein production 
5 or in providing vector systems for somatic gene therapy or In being used 
as vaccination agents. 

Thus, the present invention provides 

(1) a recombinant influenza virus for high-yield expression of incorporated 
10 foreign gene(s), which is genetically stable in the absence of any helper 

virus and which comprises at least one viral RNA segment being an 
ambisense RNA molecule (hereinafter "ambisense RNA segment") and 
containing one of the standard viral genes in sense orientation and a 
foreign, recombinant gene in anti-sense orientation, or vice versa^ in 
15 overall convergent arrangement; 

(2) a preferred embodiment of the recombinant influenza virus defined in 
(1) above, wherein one or more of the regular viral RNA segments, 
differing from said at least one ambisense RNA segment (hereinafter 
"modified regular segment"), comprises a vRNA encoding a foreign gene, 

20 preferably one or more of the regular viral RNA segments has (have) been 
exchanged for a vRNA encoding a foreign gene; 

(3) a preferred embodiment of the recombinant influenza virus defined in 
(1) and (2) above, in which the terminal viral RNA sequences of said one 
or more modified regular segments and/or of said at least one ambisense 

25 RNA segment, which are active as the promoter signal, have been 
modified by nucleotide substitutions in up to five positions, resulting in 
improved transcription rates of both the vRNA promoter as well as the 
cRNA promoter as present in the complementary sequence; 

(4) a method for the production of recombinant influenza viruses as 
30 defined in (1) to (3) above comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, in 
antisense, sense or ambisense design, 
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(b) followed by infection with an influenza carrier strain constructed to 
include flanking ribozyme target sequences in at least one of its viral RNA 
segments, and 

(c) thereafter selective vRNA inactivation through ribozyme cleavage; 

5 (5) a method of constructing influenza carrier strains carrying one or more 
ribozyme target sites (of type one) in vRNA flanking positions comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, carrying 
two different 3' promoter sequences In tandem, which are separated by a 
second type of ribozyme target sequence, and which carry the said 

10 internal ribozyme target sites of type one; 

(b) followed by infection of an Influenza wildtype strain; 

4mh (c) thereafter amplification through simple steps of viral propagation; and 

(d) finally isolation through removal of their external 3* promoter 
sequence by ribozyme cleavage through Infection of cells expressing 

15 ribozyme type 2, followed by plaque purification, said method being 
suitable for the construction of an influenza carrier strain required for step 
(b) of (4) above; 

(6) a ribozyme-sensitlve influenza carrier (helper) strain obtainable by the 
method of (5) above; 
20 (7) a pharmaceutical composition comprising a recombinant influenza 
virus as defined In (1) to (3) above; 

(8) the use of a recombinant influenza virus as defined in (1) to (3) above 
" ^ for preparing a medicament for vaccination purposes; 

(9) the use of a recombinant influenza virus as defined In (1) to (3) above 
25 for preparing agents for somatic gene therapy; 

(10) the use of a recombinant influenza virus as defined in (1) to (3) 
above for preparing agents for transfer and expression of foreign genes 
into cells (abortively) infected by such viruses; 

(11) the use of a recombinant influenza virus as defined in (1) to (3) 
30 above for preparing agents for transfer and expression of RNA molecules 

into cells infected by such viruses; 
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(12) a method for the production of proteins or glycoproteins which 
comprises utilizing a recombinant Influenza virus as defined in (1) to (4) 
above as expression vector; 

(13) a method for preventing and/or treating influenza which comprises 
5 administering a recombinant influenza virus as defined in (1) to (3) above 

to the mammal to be treated, i.e., a vaccination method utilizing said 
recombinant virus; 

(14) a method for somatic gene therapy, which method comprises 
subjecting the organism to be treated with a recombinant influenza virus 

10 as defined in (1) to (3) above; 

(15) a method for transfer and expression of foreign genes into ceils, and 
for transfer and expression of RNA molecules into cells, which method 
comprises infecting the cells with a recombinant Influenza virus as defined 
in (1) to (3) above; 

15 (16) use of a recombinant influenza virus as defined in (1) to (3) above 
for preparing agents for autologous immunotherapy; 

(17) a method for an immunotherapy which comprises ex vivo infection of 
immune cells with a recombinant influenza virus as defined In (1) to (3) 
above, and introduction of the transduced cells into the patient; and 
20 (18) a method for the induction of antibodies which comprises utilizing a 
recombinant influenza virus as defined in (1) to (3) above as an 
immunogen. 

Brief Description of tlie Figures 

25 Fig.l shows 3' nucleotide extensions of influenza vRNA template 
molecules. 

Fig. 2 shows propagation of recombinant influenza viruses with tandem 
bicistronic vRNA, 

Fig. 3 shows tandem bicistronic vRNA supporting an alternative mode of 
30 transcription and replication initiation 

Fig. 4 shows the outgrowth of promoter-up recombinant vRNA versus 
wildtype vRNA segments in stepwise propagation of influenza virus. 
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Fig. 5 shows pAM403 ribozyme cleavage of pHL2969 derived vRNA 
molecules at specific target sites inserted between an external and an 
Internal 3' promoter sequencp. 

Fig. 6 shows comparative cleavage analysis of model CAT vRNAs with 
5 tandem target sites in various flanking positions, by target-specific 
ribozymes. 

Fig. 7 shows an alignment of pAI^424 double-headed ribozyme with one of 
their repetitive target sequences located within the 5' and 3' vRNA flanking 
regions. 

10 Fig. 8 shows pAM424 ribozyme cleavage of resistant FPV wild-type HA 
vRNA and ribozyme-sensitive pHL2969-derived HA-vRNA in 293T cells. 
Fig. 9 shows a functional analysis of the influenza cRNA promoter 
structure. 

Fig. 10 shows a functional analysis of the vRNA and cRNA promoter in 
15 ambisense arrangement. 

Fig. 11 shows a basepair substitution according to the vRNA 'corkscrew' 
structure. 

Fig. 12 shows a flow-chart of the isolation procedure for an ambisense 

recombinant influenza virus. 
20 Fig. 13 shows immuno-electron microscopy of purified influenza 

FPV/CSFV-E2-HA virions. 

Fig. 14 shows vector pHL2969. 

Fig. 15 shows vector pAM403. 

Fig. 16 shows vector pAM424. 
25 Fig. 17 shows vector pHL2507. 

Fig. 18 shows vector pHL2583. 

Fig. 19 shows vector pHL2989. 

Fig. 20 shows vector pHL1920. 

30 Detailed Description of the Invention 

According to the present invention "influenza virus" embraces influenza A 
virus, influenza B virus and influenza C virus, with influenza A virus being 
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preferred. A "mammal" according to the present Invention Includes 
humans and animals. "Organism" embraces prokariotic and eukariotic 
systems as well as multicellular systems such as vertebrates (including 
mammals) and Invertebrates. "Infected cells" and "Infecting cells" 
5 according to the present Invention also include "abortively infected cells" 
and "abortively infecting cells", respectively. 

In a preferred influenza virus according to embodiment (1) at least one of 
the regular viral RNA segments Is replaced by an ambisense RNA segment 

10 which contains one of the standard viral genes in sense orientation and a 
foreign, recombinant gene in anti-sense orientation or vice versa in overall 
convergent arrangement. It is moreover preferred that in the ambisense 
RNA molecule said foreign recombinant gene is covalently bound to one of 
the viral genes while the original vRNA segment coding for the same gene 

15 is deleted from the recombinant virus by a specific ribozyme cleavage. 

The foreign gene(s) in ambisense covalent junction with the viral gene(s) 
preferably code for proteins and/or glycoproteins which are secreted from 
cells infected with the recombinant virus, such as lymphokines, or code for 

20 glycoproteins that are Incorporated into the virion as well as the plasma 
membrane of the infected cell. In another preferred embodiment the 
foreign gene(s) In ambisense covalent junction with the viral gene(s) code 
for proteins or artificial polypeptides designed to support an efficient 
presentation of inherent epitopes at the surface of infected cells, for 

25 stimulation of B cell and/or T cell response. Such proteins or artificial 
polypeptides constitute for instance a tumor antigen or an artificial 
ollgomeric series of T cell epitopes. Finally, the foreign gene(s) may be 
suitable for transfer and expression of RNA molecules, including antisense 
RNAs and double stranded RNAs, into cells. Such recombinant influenza 

30 viruses are suitable for sequence specific gene silencing, for example by 
antisense or RNA interference mechanisms. 
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A preferred recombinant virus of embodiment (2) of the invention is where 
in the regular viral RNA segments one or both of the standard 
glycoproteins hemagglutinin and neuraminidase have been exchanged, 
preferably into foreign glycoprotein (s) or into fusion glycoproteins 
5 consisting of an anchor segment derived from hemagglutinin and an 
ectodomain obtained from the foreign source, viral or cellular, or in which 
such recombinant glycoprotein has been inserted as a third molecular 
species in addition to the remaining standard components. 

10 As set forth in embodiment (3) above, a preferred recombinant virus of 
the invention is where the terminal viral RNA sequences, which are active 
as promoter signal, have been modified by nucleotide substitution in up to 
5 positions, resulting in improved transcription rates (of both the vRNA 
promoter and in the cRNA promoter as present in the complentary 

15 sequence) as well as enhanced replication and/or expression rates relative 
to the wild-type sequence. Said modified terminal viral RNA sequences 
differ from the wild-type sequence In that they are containing at least one 
segment (a naturally occurring segment or an additional segment) 
wherein the 12 nucleotide conserved influenza 3' terminal sequence has 
20 been modified by replacement of one to three nucleotides occurring in said 
sequence at positions 3, 5 and 8 relative to the 3' end by other 
nucleotides provided that the nucleotides introduced in positions 3 and 8 
are forming a base pair (i.e., if the nucleotide position 3 is G, than that in 
position 8 is C; if the nucleotide in position 3 is C, than that in position 8 is 
25 G; etc.). 

The 3' conserved regions of the wild-type influenza virus have the 
following sequences: 
Influenza A: 5'-CCUGCUUUUGCU-3' 
30 Influenza B: 5*-NN(C/U)GCUUCUGCU-3' 
Influenza C: 5'-CCUGCUUCUGCU-3'. 
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Moreover, the 13 nucleotide conserved influenza 5'-termlnal sequence 
may be modified by replacement of one or two nucleotides occurring in 
said sequence as positions 3 and 8 by other nucleotides, again provided 
that the introduced nucleotides are forming a base pair. The 5' conserved 
regions of the wild-type influenza virus have the following sequences: 
Influenza A: 5'-AGUAGAAACAAGG 
Influenza B: 5'-AGUAG(A/U)AACA(A/G)NN 
Influenza C: 5'-AGCAGUAGCAAG(G/A) : 

Preferred influenza viruses of the invention are those wherein in the 3' 
conserved region the replacements G3A and C8U have been performed, 
more preferred are those where also the replacement U5C has been 
performed (the above mutations are relative to the 3' end; such counting 
from the 3' end is also indicated by a line on top of the digit, e.g., G 3A). 
Another preferred Influenza virus mutant comprises the 3'-terminal 
nucleotide sequence G3C, U5C and C8G (relative to the 3' end) giving the 
following 3' terminal nucleotide sequence 5'-CCUCGUUCUCCU-3'. Among 
the influenza viruses defined hereinbefore those having a 3*-terminal 
nucleotide sequence of 5'-CCUGUUUCUACU-3' are most preferred. In case 
of an influenza A virus the segment may further have the modifications 
U3A and A8U in its 5* terminal sequence, in case of influenza C it may 
have the modifications C3U and G8A in its 5' terminal sequence. The most 
preferred influenza viruses of the present invention comprise the following 
general structures: 
Influenza A (mutant DHL 1104'>: 

S'-AGUAGAAACAAGGNNNUs-s.. (880-2300 ntds)..N'N'N*CCUCUUUCuACU-3' 

Influenza A (mutant pHL 1920^: 

5'-AGAAGAAUcAAGGNNNUs-6.. (880-2300 ntds)..N'N'N'CCUCUUUCuACU-3' 
Influenza A (mutant pHL 1948'>: 

5'-AGUAGAAACAAGGNNNU5-6.. (880-2300 ntds)..N*N*N'CCUCGUUCuCCU-3' 
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Influenza B: 

5'-AGUAG(A/U)AACA(A/G)NNNNNU5-6..(880-2300 ntds)..N'N'N*N'N'(C/U)GUuuCuACU-3* 

Influenza C: 

5'-AGUAGUAAcAaG(G/A)GU5.6..(880-2300 ntds)..CCCCUGUUUCuACU-3' 

In the above structures the variables are defined as follows: 

(1) Underlined and enlarged letters show the required mutations relative 
to the wild-type sequence for preparing a promoter mutant with enhanced 
properties; 

(2) enlarged A in the 5'-part of the sequence: additional A (position 10), 
angle-forming; 

(3) (A/G) at one position: different isolates or single segments with 
different sequence at this respective positions which are analytically 
interchangeable; 

(4) N and N': undefined, but base paired positions relative to each other in 
complementarity between the 5' and 3' termini, different among the 8 
segments, but constant for each segment throughout all viral isolates; 

(5) (880-2300 ntds): the lengths of the virus segments, in case of 
segments with foreign genes increased up to 4,000 nucleotides. 

The pharmaceutical composition according to embodiment (7) above and 
the medicament of embodiment (8) above contains the recombinant 
influenza virus in a pharmaceutically effective amount. Besides said 
recombinant influenza virus, the pharmaceutical composition and the 
medicament may contain further pharmaceutically acceptable carrier 
substances well-known to a person skilled in the art, such as binders, 
desintegrants, diluents, buffers, preservatives, etc. The pharmaceutical 
composition and medicaments are solid or liquid preparations and are 
suitable to be administered orally, intravenously or subcutaneously. 

The medicament according to embodiment (8) above is preferably suitable 
as a medicament against influenza and/or against other infections. The 
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recombinant influenza virus may be present in form of inactivated 
preparations or may be present in form of live recombinant viruses, 
preferably as attenuated viruses. 

5 Live recombinant viral vaccines, live but attenuated recombinant viral 
vaccines or inactivated recombinant viral vaccine can be formulated. 
Inactivated vaccines are "dead" in the sense that their infectivlty has been 
destroyed. Ideally, the infectivity is destroyed without affecting its 
immunogeniclty. To prepare inactivated vaccines, the recombinant virus 
10 may be grown in cell cultures or in embryonated chicken eggs, purified, 
and inactivated by formaldehyde or B-propiolactone. The resulting vaccine 
is usually administered intramuscularly. 



Inactivated viruses may be formulated with a suitable adjuvant to 
15 enchance the immunological response. Such adjuvants may include, but 
are not limited to mineral gels, e.g., aluminum hydroxide, surface-active 
substances such as pluronic polyols, iysolecithin, peptides, oil emulsions, 
and potential useful human adjuvants such as BCG. 



20 Many methods may be used to introduce the vaccine formulations above, 
for example the oral, intradermal, intramuscular, intraperitoneal, 
subcutaneous, or intranasal routes. Where a live recombinant virus 
vaccine is used, it may be preferable to introduce the formulation via the 
natural route of infection for influenza virus. 

25 

The medicament according to embodiment (8) above is preferably suitable 
for prophylactic or therapeutic vaccination, or both, against influenza and 
other infections. For example, recombinant viruses can be made for use in 
vaccines against HIV, hepatitis B virus, hepatitis C virus, herpes viruses, 
30 papilloma viruses, to name but a few. In one embodiment the 
recombinant virus contains the genes for surface proteins of the viruses, 
in another the genes for non-structural or regulatory genes. The 
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recombinant viruses may be present In form of inactivated preparations or 
may be present in form of live recombinant viruses, or in live, but 
attenuated viruses. In an attenuated virus the recombinant virus would go 
through a single or very few' propagation cycle(s) and induce a sufficient 
level of Immune response, but would not cause disease. Such viruses lack 
one of the essential influenza genes or contain mutations to introduce 
temperature sensitivity. 

The agents of embodiments (9)-(ll) of the invention are applicable in ex 
vivo and in vivo application schemes. The RNA molecule to be expressed 
by means of the agent of the embodiment (11) is of an antisense 
sequence or double strand sequence (in ambisense bidirectional 
transcription) relative to a target cellular mRNA molecule. In embodiment 
(11) the agent is preferably suitable for sequence-specific gene silencing, 
preferably by antisense RNA or RNA interference mechanisms. 

The method for the production of proteins or glycoproteins is preferably 
performed in cell culture cells or in fertilized chicken cells in accordance 
with standard techniques within the general knowledge of a person skilled 
in the art. The proteins or glycoproteins to be expressed are those 
incorporated into the ambisense construct as defined hereinbefore. 

The methods according to embodiments (13) to (15), (17) and (18) of the 
invention include the administration of an effective amount to the mammal 
or the administration of a sufficient infective dose of the recombinant virus 
to the cell system that is used for ex vivo therapy or for in vitro 
investigations, whereby the amount and dose will be determined by a 
parson skilled in the respective arts or knowledgeable of the desired 
treatments. 

The agent of embodiment (16) of the invention is preferably utilized to 
infect, transfect or transduce patient-derived immune cells. The agent is 
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suitable for treatment of cancer or chronic viral infections. For this 
purpose, patient derived immune cells, preferably dendritic cells, are ex 
vivo infected with recombinant influenza viruses expressing, e.g., tumor 
antigens or viral antigens. The transduced cells are then reintroduced into 
the patient. 



The preferred method for immunotherapy of embodiment (17) of the 
invention is an autologous immunotherapy, wherein the cells which are ex 
vivo infected are patient-derived and the transduced cells are reintroduced 
into the patient. The diseases to be treated by this method include cancer 
and chronic viral Infections. For details regarding such treatment see 
discussion of embodiment (16) above. 

The method for inducing antibodies according to embodiment (18) of the 
invention is suitable for inducing antibodies to foreign proteins including 
glycoproteins, following the administration of protein or glycoprotein 
antigens as part of a recombinant influenza virus in an authentic 
conformation, whereby the virus is purified by gentle procedures based on 
hemagglutination, and the gene is expressed at high rates In the Infected 
cells. 

As Influenza viruses have a wide host range, recombinant influenza 
viruses can be used to obtain strong immune responses in, and isolate 
antibodies from, a wide range of animals, including, but not limited to, 
fowl, pigs, goats, horses, and mice. Further, influenza viruses adapted to 
the mouse can be used for the infection of mice, by several routes 
including the intranasal route. This results in infection of the pharyngeal 
mucosal cells and results in an additional type of B cell response (e.g., as 
recognized in the ratio of IgG to IgA). Mice are of particular utility in the 
induction of immune responses in transgenic mice that have been 
engineered to express human antibodies. As gentle procedures based on 
hemagglutination are used to purify influenza viruses, antibodies to 
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antigens in native conformation can be isolated from the infected 
mammals- 

Further preferred embodiments of the invention are set forth herein 
5 below. 

A. Construction of influenza helper virus strains carrvino ribozvme 
target sequences in flanking positions within either of the vRNA 
segments 

10 

A.l: Influenza RNA polymerase will initiate transcription and replication 
from promoter structures located at internal positions in an RNA molecule, 
not only from the natural position at both ends of a vRNA molecule: 
This is true in particular for promoter-up variants in RNA-lnternal location 

15 due to their enhanced binding affinity for viral polymerase. Not only 3' end 
extensions are tolerated In RNA-lnternal promoter recognition (Fig. 1), but 
also 5' extensions as well as extensions at both ends of the RNA template 
molecule, containing noncomplementary as well as complementary 
sequence, i.e. potentially present as a double-stranded extension. Finally, 

20 also an extension by way of duplication of the promoter sequence (active 
or inactivated) leads to mRNA transcription and CAT expression, initiated 
from the active pair of 5' and 3' promoter halves, irrespective if in external 
or in internal or even in an oblique localization (active 5' promoter 
sequence in external, active 3' promoter sequence In internal position, or 

25 vice versa). RACE-determination of the resulting 5' and 3' ends of viral 
mRNA and cRNA, i.e. the products of transcription and replication 
reactions for several of the extended template vRNA constructs proves an 
exact recognition and sequence-specific initiation at a position equivalent 
to regular 3' position 1: all of the various template extensions are lost in 

30 every product RNA molecule, most likely after only one round of 
replication. 
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A-2: Bicistronic (tandem) vRNA molecules carrying an additional 
3* specific promoter sequence in a central position between its two genes 
can be used for an indirect 'selection method for recombinant influenza 
5 viruses: 

The method described is applicable for any foreign gene (e.g. CAT) 
without a selection potential of its own, if inserted into the distal mRNA 
position (proximal vRNA position, in anti-sense orientation) behind a 
carrier gene (e.g. GFP) in the proximal mRNA position, able to serve as a 

10 transient selection marker. The carrier gene which is used for selection will 
get lost spontaneously during further propagation. These constructs are 
equivalent to a 3' extension of the template vRNA by a full-size gene of 
750 nucleotides up to a second 3' promoter sequence, in terminal location- 
While in the set of experiments shown in Fig. 2 the external 3' promoter 

15 sequence is maintained throughout as the same promoter-up variant 
(1104), the internal 3' promoter sequence has been varied to include a 
full-level promoter-up variant (pHL2270, containing promoter mutant 
1104), a medium-level promoter variant (pHL2350, containing promoter 
mutant 1948), a wildtype promoter construct (pHL2629), and a construct 

20 carrying an unrelated central sequence in an otherwise identical design 
(pHL2300). 

Due to the presence of two 3V promoter sequences in conjunction with a 
single 5' promoter sequence an alternating interaction between them will 
constitute one or the other active promoter structure (see Fig. 3). While 

25 the external location with an adjacent RNA 3' end may have a structural 
advantage, this appears to be compensated by the shorter distance in an 
interaction between the 5' sequence and the central 3' sequence in 
constituting the internal promoter, such that the competition between the 
two primarily reflects the various internal 3' promoter allele sequences 

30 used, compare Fig. 2B and activity ratios indicated above and below the 
lanes. Translation of the mRNA-distal gene (CAT) is only observed 
following an internal initiation at the bicistronic vRNA template, resulting in 



wo 00/53786 PCT/EPOO/01903 

19 

a spontaneous deletion of the mRNA-proximal (vRNA-distal) gene, GFP, 
compare right half of Fig. 3. In complementary analyses GPP fluorescence 
is observed initially for all of the biclstronic constructs, but gets lost on a 
faster rate from pHL2270 transfected and influenza infected cells (not 
5 shown), and will stay unchanged in pHL2300-treated cells. The indirect 
selection system based on bicistronic (tandem) molecules as designed 
here and demonstrated for reporter genes GPP and CAT can be used for 
any other gene without distinct phenotype upon insertion behind an 
unrelated carrier gene with properties useful in selection. - In employing 
10 that technique an initial phase of (repeated) positive selection for infected 
cells expressing that proximal trait (e.g. by FACS or by magneto-beads) 
will be followed by a second phase with negative selection. I.e. against 
that fraction of infected cells still exposing the same property, 

15 A. 3: Isolation of an influenza strain designed to carry 2x2 flanking 
ribozyme target sequences at the 5' and 3' end of vRNA segment 4 coding 
for hemagglutinin: 

The above scheme for an indirect selection of any foreign recombinant 
gene behind a proximal carrier gene is further modified by deleting the 

20 carrier gene altogether. Instead, both 3' terminal promoter sequences 
(mutant and wildtype) follow each other at a short distance, separated 
only by a specific, repetitive ribozyme target sequence, - different from 
other target sequences to be described further below. The cDNA insert 
following after the second 3' promoter sequence consists mainly of a 

25 regular hemagglutinin (H7) coding sequence, however both the 5' and 3' 
vRNA terminal regions of the insert carry that other ribozyme target 
sequence (different from the first target sequence mentioned above) 
inserted in either location in tandem duplication (pHL2969, see Pig. 14). 
Due to a superior replication supported by the promoter-up variant located 

30 in 3' external position the recombinant HA segment attached to that 
promoter sequence will become enriched during the first steps of viral 
propagation, while the originally dominating HA segment of the helper 
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Virus which is under control of a wildtype promoter sequence is 
consecutively reduced and finally is no longer detectable among viral 
progeny. This result is documented by RT-PCR analysis of consecutive viral 
populations as obtained in that stepwise propagation procedure, see Fig. 
5 4. 

In the next step the viral lysate is twice passaged via infection of cell 
culture cells (293T) that before have been DNA-transfected by plasmid 
pAM403 (Fig. 15). This construct has been specifically designed to express 
a hammerhead ribozyme with flanking sequences complementary to the 

10 repetitive GUC-containing target sequence, as present twice in between 
the external and internal 3' promoter signals in the recombinant HA vRNA 
segment, see Fig. 5. In this way the extra promoter sequence is cut off 
from the finally resulting recombinant HA vRNA. Its promoter-up activity 
was useful in achieving an initial increase in the concentration of 

15 recombinant HA vRNA over wildtype HA vRNA, and in finally excluding the 
latter from further propagation. However, for the same reason that high 
activity of the promoter variant will cause instability in the resulting viral 
progeny, and an effective 'substitution' at this time through ribozyme 
cleavage by the internally located promoter signal, wild-type or slightly 

20 improved, will restore stability to the progeny viruses, with all of their 
eight vRNA segments now brought back in balance to each other. Due to 
the ribozyme cleavage site at 26 nucleotides 3' of the wild-type promoter 
sequence (see Fig. 5), in the initial stage that promoter signal is situated 
in a vRNA-internal location, extended by a 3' terminal sequence of 26 

25 nucleotides. According to the data presented in Fig. 1 this should cause a 
transient slight reduction in activity, resulting however in one step in 
regular viral mRNAs and cRNAs, with any initially remaining extra 
sequence being lost from the finally resulting recombinant HA vRNA. 
Progeny viruses still carrying an external promoter-up sequence (before 

30 ribozyme treatment or due to incomplete reaction) will not cause any 
plaque, due to over-replication of one vRNA segment relative to all others 
which results in a high load of defective particles. However, progeny 
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Viruses which have lost that external promoter element due to ribozyme 
cleavage will yield regular plaques due to a balanced mode of replication 
for ail eight wild-type or recombinant vRNA segments. Hence plaque 
purification is used for isolating a pure influenza viral strain carrying 2x2 
ribozyme targets in its recombinant HA vRNA segment, with its termini 
reduced to the wild-type promoter sequence. The nature of the viral strain 
isolated has been confirmed In this regard by RT-PCR analysis, see Fig. 4. 
The above isolation procedure resulting in influenza viral strains carrying 
2x2 flanking ribozyme target sequences has been performed twice for the 
HA coding segment (segment 4) to obtain two different isolates with 
regard to the orientation of the ribozyme target sequences. In one of the 
isolates (vHM41, see SEQ. ID NO: 3) the tandem target sites have been 
inserted into the HA vRNA non-translated sequence both in 5' and 
3' location, while according to the second design that 5' tandem target 
sequence has been included in an inverted orientation, such that it is now 
present in the cRNA 3' sequence instead (vHM42). 

In another experiment the same procedure was used to isolate an 
influenza strain carrying 2x2 tandem target sites within the 5' and 
3' flanking positions of segment 8 vRNA, i.e. coding for genes NSl and 
NS2 (VHM81; see SEQilD NO:4). And in principle the same could be done 
for any other influenza segment, in particular since only the reading frame 
cDIMA sequence has been exchanged from HA to NS, with all of the 
flanking elements directly responsible for that procedure remaining in 
place, unchanged. 

A.4: Ribozyme cleavage and vRNA segment exchange using ribozyme- 
sensitive influenza strains as helper viruses: 

In an initial model experiment a range of ribozyme type and target site 
locations was probed in designing a series of CAT reporter gene vRNA 
constructs (analysed In the presence of a surplus of wildtype helper virus) 
in 293T cells. While all of the ribozyme constructs adhered to the 
hammerhead model, with 10 to 12 nucleotides of complementary 
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sequence flanking on either side of the GUC target site, these ribozyme 
constructs varied from mononner to dimer to trimer repetitions. Ribozynne 
containing mRNAs were synthesized in vivo via the basic vector plasmid 
pSV2-/?eo, i.e. using the efficient psve RNA polynnerase II promoter 
5 element for expression, and the SV40 origin signal for plasmid 
amplification, in a cell line (293T or cos-l) with an incorporated SV40 
T antigen gene. In addition the pSV2-/7eo mRNA includes the small, 63 
nucleotide intron sequence of the S\/40 early mRNA which is supposed to 
be spliced very slowly, thereby extending the pre-mRNA half-live in the 

10 nucleus. Each of the pSV2-/?eo-ribozyme plasmid constructs was 
transfected into 293T cells. Thereafter, recombinant viruses containing 
dimer target sites either near one end of the molecule only, or near both 
ends have been used for infection of the transfected cells. Relative 
activities of ribozyme constructs versus vRNA target sites have been 

15 measured via inactivation of CAT acetyl transfer rates in the cell lysates 
obtained at 8 h post infection (Fig. 6). The highest activities were 
observed for dimer ribozymes acting on vRNA molecules carrying 2x2 
target sequences on both ends of the molecule, either in vRNA 5' and 3' 
location, or in vRNA 3' and cRNA 3' location, i.e. with an Inversion of the 

20 target site sequence at the vRNA 5' end. 

Consequently, the two constructs carrying tandem ribozyme double 
targets within both of their non-translated vRNA flanking sequences have 
been used in the design of ribozyme-sensitive influenza virus strains as 
described above, with both variants isolated for segment 4 (HA), and one 

25 of them for segment 8 (NS). In complementary correspondence the 
hammerhead ribozyme plasmid used has also been constructed as a 
double-headed structure with flanking sequences as shown in Fig. 7 
(pAM424; for its detailed structure, see Fig. 16). 

30 The three target site-containing influenza strains isolated as described 
above have been analysed for their sensitivity against ribozyme cleavage 
by infection of 293T cells, which had been DNA-transfected 18 h eariier by 
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ribozyme-producing plasmid pAM424, at DNA-transfection rates between 
60 and 70 % (as observed in parallel transfections using GFP-expressing 
plasmid pAM505). Inactlvation rates in these one-step control experiments 
were between 90% and 99% for all three ribozyme-sensitive strains, in 
5 their extent mainly depending upon the actual DNA-transfection rates 
achieved in individual experiments, 

In the next step both isolates of HA-coding ribozyme-sensitive viruses, 
VHM41 and vHM42, have been used in marker-rescue experiments. Here, 
293T cells have been first DNA transfected by HA-variant cDNA construct 

10 PHL2507 (see Fig. 17), followed after 18 h by vHM41 or vHM42 virus 
infections at moi 1. The resulting viral supernatant containing e.g. a 
mixture of ribozyme-sensitive vHM41 carrier virus and pHL2507/vHM41 
recombinant virus is propagated In an intermediate step on MDCK cells, 
which also results in an increase in the fraction of recombinant viruses. 

15 Thereafter the resulting virus-containing supernatant is passaged through 
293T (or cos-1) ceils, which in advance have been transiently transfected 
by ribozyme-producing pAM424. As may be concluded from the above 
experiment (Fig. 6) and shown in Fig. 8 vHM41- or vHM42-derived 
ribozyme-sensitive HA vRNA segments are expected to be inactivated by 

20 pAM424-produced ribozyme down to a remaining level of about 1% to 
10% (mainly present within cells that are infected, but not DNA- 
transfected). 

Instead, the substitute HA vRNA which originated from pHL2507 plasmid 
25 DNA (ribozyme-resistant) becomes exclusively incorporated into progeny 
virions. For further purification and viral propagation these have been 
passaged a second time through 293T cells, again in advance DNA- 
transfected by pAM424, and after another amplification step on MDCK 
cells the resulting viral preparations have been used for RT-PCR analysis. 
30 The resulting viral populations in these marker rescue experiments only 
contain HA-vRNA molecules derived from pHL2507, which in their PCR 
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analyses are of intermediate size relative to wildtype HA-vRNA, and 
VHM41- or vHM42-derived HA-vRNAs, respectively. 

Consequently, a set of ribozyme-sensitive influenza strains witii targets 
inserted Individually into every vRNA molecule may be used for such one- 
5 step marker rescue experiments in general, i.e. for vRNA segment 
exchange reactions performed in a directional way for any of the eight 
influenza vRNA segments, without requirement for a selectable change in 
phenotype (genetic marker). 

^ Expression of two gene pro ducts from ambisense bicistrnnir 

influenza vRNA 



B.l: The influenza cRNA promoter is active in antisense viral mRNA 
transcription according to the cap-snatching mode of initiation: 

15 While the vRNA template of influenza virus is known to be active in viral 
mRNA 5s well as cRNA synthesis, the cRNA template has been described 
so far only to produce vRNA molecules, as a second step in viral 
replication. The potential activity of the cRNA promoter in initiating also 
viral mRNA transcription has not been analysed or even suspected so far, 

20 since no antisense (vRNA) reading frame can be detected In any of the 
viral RNA segments. Also, no Us/Ue template sequence element is present 
in any of the viral cRNAs in an adjacent position to its 5' promoter 
structure as is the case for all viral vRNAs. This element is known to serve 
as a template sequence for mRNA terminal poly-adenylation, in repetitive 

25 Interaction. However, when both elements are provided through 
reconstruction of an artificial influenza cRNA segment: a reading frame in 
opposite orientation (CAT), and a Ue template element in 5' adjacent 
location, CAT expression can indeed be observed, see pHL2583 (see Fig. 
18) in Fig. 9. Similar to the vRNA promoter the cRNA promoter activity is 

iO Improved by (the same) promoter-up mutations, which essentially consist 
of basepair exchanges according to the 'corkscrew' model. This model 
apparently also holds for the cRNA promoter structure as analysed in a 



wo 00/53786 PCT/EPOO/01903 

25 

Stepwise manner in Fig. 9. While the cRNA promoter has to be superior 
over the vRNA promoter in its initiation of replication, since the 
vRNA/cRNA product ratio was determined to be around 10:1 (Yamanaka 
et al.. In vivo analysis of the promoter structure of the influenza virus RNA 
genome using a transfection system with an engineered RNA. Proc. Natl. 
Acad. Sci. USA 88, 5369-5373 (1991)), the cRNA promoter is observed to 
be inferior to the vRNA promoter in its initiation rate of transcription 
(compare pHL2583 with pHL1844 in Fig. 9), at least for all promoter 
variants tested so far. 

A RACE analysis for determination of the 5' ends in pHL2583 cRNA 
promoter transcribed mRNAs proved this initiation to occur according to 
the cap-snatching mode, in complete equivalence to standard vRNA 
promoter controlled transcription initiation. 

a2l Development of ambisense influenza constructs for consecutive 
expression of two genes (GFP and CAT) from a single viral RNA: 
For a bidirectional transcription and translation of influenza RNA segments 
the two reporter genes GFP and CAT have been arranged in opposite 
orientation to each other, and the flanking 5" and 3" promoter sequences 
(adhering to promoter-up variant 1104) had to be reconstructed to include 
a Ue poly-adenylation element in either orientation in a 5' promoter 
adjacent position. This requirement necessarily resulted in a promoter- 
adjacent 5'-U6/3'-A6 complementary structure, both in the vRNA and cRNA 
terminal sequence (see Fig. 10), which had to be tested for its promoter 
activity, in either orientation. Therefore, the convergent pair of reporter 
genes GFP and CAT has been inserted in both orientations, such that 
CAT transcription is initiated by the vRNA promoter in one construct 
(PHL2960), and by the cRNA promoter in the other (pHL2989, Fig. 19), 
and vice versa for GFP expression from both ambisense constructs. In 
addition, also the CAT gene only has been inserted In either orientation 
between the 5' and 3' elements of that ambisense promoter, with CAT 
transcribed by the vRNA promoter in one case (pHL2959), and by the 
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cRNA promoter in the other (pHL2957). The whole set of constructs allows 
for a direct comparison with corresponding reference constructs carrying a 
regular vRNA promoter (pHL1844) or cRNA promoter structure (pHL2583), 
i.e. carrying only the 5'-adjacent Ue sequence element and no S'-Ae 
counterpart. The two groups of constructs also differ In insert size, since a 
single inserted gene roughly accounts for 750 nucleotides, the convergent 
set of two genes for 1500 nucleotides, with the distal half of both mRNAs 
in this case remaining untranslated, a situation unusual for influenza viral 
mRNAs. 

As is demonstrated in Fig. lOB for CAT expression of the various 
ambisense constructs all of them are able to initiate transcription in both 
orientations, even if at different levels with regard to their vRNA and cRNA 
promoter-dependent expressions, and also with regard to the insert 
lenghts and convergent arrangements of the GFP/CAT versus CAT-only 
constructs. Analysis of the GFP expression rates (not shown) yields 
complementary results, i.e. again vRNA promoter-controlled GFP 
expression Is superior over cRNA promoter expression of GFP. Therefore, 
individual ambisense clones either show an asymmetric high expression of 
GFP and low expression of CAT (pHL2960) or vice versa (pHL2989), 
depending on their orientation of reading frames with regard to the 
external vRNA and cRNA promoter. Fig. IOC also demonstrates successful 
propagation of recombinant viruses containing ambisense RNA molecules, 
which proves survival through amplification, packaging into virions, and 
expression of both mRNAs in infected MDCK cells (including besides CAT 
also GFP expression). 

B-3: Construction of a superior promoter-up mutation, pHL1920, to be 
used for Improved rates of cRNA promoter expression in ambisense 
constructs: 

An extended analysis of promoter variants, in particular of complementary 
double exchanges according to the 'corkscrew' model yielded among 
others variant pHL1920 (Fig. 20) with CAT activity rates considerably 



WO 00/53786 PCT/EPOO/01903 

27 

above (125-130% of) the rates observed for standard promoter-up 
variant '1104' (as present in pHL1844), The *1920' promoter-up variant 
consists of altogether 5 nucleotide substitutions relative to the wildtype 
promoter sequence, both In the 5' promoter element (2), and the 
5 3' promoter element (3). The structure of this variant and the whole set of 
complementary double exchanges is presented in Fig, 11, together with 
the respective CAT activity measurements, in vRNA promoter constructs. 
vRNA promoter-up variants also show similarly Improved expression in 
(ambisense) cRNA constructs, even if at generally lower levels than in 

10 vRNA constructs- cRNA promoter-up expression is observed at levels 
similar or somewhat (2x-5x) above the wild-type vRNA promoter rate, 
while vRNA promoter-up constructs show CAT expression rates increased 
up to 20 or 25 times the wild-type vRNA promoter level. In either case 
expression rates also depend on the size of the insert, with promoter 

15 activity rates decreasing with increasing lengths of the influenza RNA 
molecules to be transcribed. 

B.4: Influenza recombinant viruses containing a foreign gene (CAT) in 
covalent ambisense linkage with one of the viral genes (HA, NS1/NS2): 

20 The principle solution in designing stable recombinant viruses based on 
the new properties observed for influenza transcription and replication 
signals consists in constructing viruses which contain a foreign gene in 
covalent linkage with one of the (indispensable) viral genes, in ambisense 
biclstronic organization. Preferably the viral gene is connected to the cRNA 

25 promoter, while vRNA promoter expression is used for expression of the 
foreign gene at rates considerably above the viral mRNA synthesis. The 
promoter-up variant chosen for constructing the ambisense RNA segment 
intends to bring its cRNA promoter expression (approximately) into 
balance with all other viral gene expression levels, which are controlled by 

30 wild-type vRNA promoters located at the termini of the seven ordinary 
influenza segments; the respective choice has to take into consideration 
the overall length of the ambisense segment. 
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Isolation of the amblsense recombinant virus employs an RNA 
polymerase I-transcribed amblsense cDNA construct, which will give rise 
in vivo to ambisense cRNA-type molecules, see Fig. 12. The plasmid DNA 
transfection mixture used in this step with 293T cells in addition may or 
may not contain four 'booster' plasmlds which under pcMv-control produce 
the four early influenza proteins from non-viral mRNAs: NP, plus PBl, PB2, 
and pa; i.e. the three subunits of viral polymerase (Plesehka et al., A 
plasmid-based reverse genetics system for influenza A virus J. Virol. 70, 
4188-4192 (1996)), which will increase in a pre-amplification step the 
copy number of that ambisense viral cRNA segment. At 18 h post 
transfection the 293T cells are infected by a ribozyme^sensitive influenza 
strain, e.g. vHM41, which will supply (again) early and also late viral 
RNAs. The resulting supernatant which contains a mixture of vHM41 
carrier virus and vHM41-derived ambisense recombinant virus (nine vRNA 
segments) is then passaged directly or via an intermediate step of 
amplification on MDCK cells onto 293T cells that have in advance been 
DNA-transfected by ribozyme-producing pAM424. Here, the ribozyme- 
sensitive vRNA segment of vHM41 will be cleaved at its 2x2 target sites by 
pAM424 specific ribozymes. In recombinant viruses the vRNA gene lost in 
this way is re-supplemented through its presence within the ambisense 
segment. The virus-containing supernatant is passaged for amplification 
and further purification through ribozyme treatment a second time on 
293T cells which again have been pretreated by pAM424 DNA transfection. 
Absence of ribozyme-sensitive vRNA, and presence only of ambisense RNA 
in RT-PCR analysis at this stage allows for further amplification on MDCK 
cells and a final virus stock preparation on embryonated chicken eggs. 
CAT assays can be used to analyse for the presence and monitor the 
activity of this model foreign gene through the various steps of isolation 
and propagation as well as document technical improvements that might 
be worked out for one or more of the processive stages. 
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Examples for application of h elper-free, stable recombinant influenza 

viruses 

CJj. Incorporation of reporter gene GFP in NS/GFP or HA/GFP ambisense 
segments: 

Recombinant viruses of this type will allow to follow-up on influenza 
infection instantly and continuously in individual infected cells, which may 
also be counted or documented by fluorescence photography or FACS 
sorting. With improved temperature resistance (Siemering et al.. 
Mutations that suppress the thermosensitivity of green fluorescent protein. 
Curr. Biol. 6, 1653-1663 (1993)) GFP expression becomes visible at 2 h 
after infection and shows bright fluorescence after 4 h p.i., it will be 
possible to follow-up on the spread of viral infection by GFP fluorescence. 
Stable fluorescence in abortively infected cells e. g. observed In ex vivo 
treatment of dendritic cells similarly supports a follow-up on their 
reincorporation into animals; other genes may be incorporated by 
ambisense vRNA into dendritic cells in the same way. 

C2i Construction of glycoprotein CSFV-E2 carrying influenza, helper- 
free: 

The glycoprotein E2 of CSF virus has been incorporated previously into 
influenza both as an HA-anchor fusion protein within the viral envelope, 
and as an additional, unstable ninth vRNA segment into its genome (Zhou 
et al.; Membrane-anchored incorporation of a foreign protein in 
recombinant influenza virions. Virology, 246, 83-94 (1998)). Stabilization 
Is now achieved through an ambisense connection with either of the 
regular viral RNA segments (NS or HA) which also allowed to reach a level 
of 100% recombinant viruses instead of an hitherto only 20% (Fig. 13), 
since all carrier viruses are destroyed through ribozyme action, see Figs. 8 
and 12. The helper virus containing preparation has already been used 
successsfully as a vaccine against CSFV infection (antibody titers of 
1:40000); the Increase achieved in recombinant viruses allows a further 
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improvement in that regard. Also (cost-effective) propagation in fertilized 
chicken eggs has become possible due to its stable incorporation of the 
foreign gene as a covalent ambisense construct. 

5 C.3: Construction of hepatitis C glycoprotein-carrying recombinant 
influenza viruses as a candidate vaccine: 

Hepatitis C virus is a close relative of CSF virus (hog cholera virus), and in 
particular its set of two glycoproteins, small-size El and larger-size E2, is 
closely related in structural detail and presumably also in function to the 

10 corresponding CSFV proteins. An incorporation of HCV-E2/HA fusion 
proteins into influenza viral envelopes has been achieved in analogy to the 
CSFV-E2/HA Incorporation. In addition, incorporation of an anchor-fusion 
glycoprotein HCV-El/HA or both together (in NS and HA ambisense 
junctions) allows further variations in constructing an influenza-based 

15 vaccine for hepatitis C. In analogy to CSFV-E2, neutralizing antibodies are 
expected to be directed against particular epitopes of HCV-E2, presented 
in essentially native conformation at the influenza viral envelope. 

C.4: Stable incorporation of selected Influenza T-cell epitopes in 

20 ambisense constructions: 

Influenza Infection is known to result in both, antibody production against 
that specific viral strain or indeed its epitopes that are located mainly at 
the surface of HA, and in an increase of specifically primed cytotoxic T- 
lymphocytes, stimulated by T-cell epitopes primarily located within the 

25 essentially invariable core structure of the MP protein. While the humoral 
response will result in life-long immunity against that particular strain of 
influenza or its epitope structures, the T-cell response will be lost or 
severely reduced some time afterwards, such that its specificity against 
influenza in general will fall below protective levels. One way in trying to 

30 increase that level of cellular immunity is to enhance the response or 
recruitment of influenza-specific CTL cells by increasing the level of T-cell 
epitopes in the infected cell and hence its presentation on the surface by 
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MHC-I receptors. This is achieved by combining in an ambisense construct 
the HA gene and a series of repeated T-cell epitope sequences as present 
in the influenza NP gene, in a model design specific for the BALB/C mouse 
MHC-I allele. Here, the promoter-up expression rate is realized (in vRNA 
promoter-controlled Initiation) for expression of the repetitive epitope 
polypeptide chain. Alternatively or in addition a controlled secretion of an 
interleukin can be achieved from recombinant Influenza-infected cells, 
upon ambisense incorporation of the respective gene preferrably into the 
NS segment. The interleukin to be chosen for this purpose (IL-12 or other) 
is selected to enhance the longevity of influenza-specific CTL cells or its 
conversion Into corresponding memory cells. In this way an ambisense 
vaccine strain against influenza itself is achieved with expected protective 
capacity against influenza in generat- 
es Exchange of influenza glycoproteins against foreign viral 
glycoproteins (VSV-G): 

The 'marker rescue' experiment described above (section A.4.), i.e. an 
exchange of one HA gene (ribozyme-sensitive) for another (ribozyme- 
resistant) can also be used for an exchange of HA for an entirely different 
glycoprotein, such as the vesicular stomatitis virus G protein, as long as it 
is attached to the HA anchor segment. Effective incorporation depends 
always on that C-terminal tail sequence and its interaction with underiying 
matrix protein Ml, and therefore, all constructions consist of fusion 
proteins in direct analogy to CSFV-E2- VSV-G with or without a foreign 
anchor sequence has been shown in several other viruses to be able to 
substitute for the original glycoprotein and to result in infectious viruses 
with VSV-G specific host-ranges (e.g. in retroviruses, rabies virus, measles 
virus). G protein in VSV itself as well as on the surface of foreign viruses is 
the only glycoprotein required for all of the consecutive steps in infection , 
Insertion of VSV-G instead of HA in recombinant influenza viruses leaves 
the second glycoprotein, neuraminidase, without any function, which then 
will get lost spontaneously from the recombinant viruses. This will further 
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increase the capacity for an addition of foreign genes, beyond the gain 
resulting fronn an exchange of the larger HA for the smaller VSV-G, which 
might be used for an addition of ambisense constructs. 

5 The invention is further illustrated in the accompanying figures. 

Detailed Description of tlie Figures 

Fiq.l: 3' nucleotide extensions of influenza vRNA template molecules: 

10 (A) Murine B82 cells have been transfected by plasmid cDNA constructs 
designed to be transcribed into influenza vRNA molecules by RNA 
polymerase I in vivo, followed after 20 h by standard FPVBratisiava helper 
virus infection, at an moi of 1 to 3. In addition to reference plasmid 
pHL2024 (no extension), related cDNA constructs carrying extensions of 1 

15 to 50 bp, and hence extended by 1 to 50 nucleotides at the resulting vRNA 
3' ends were used in parallel transfections; template extensions are 
marked at the top of the figure. Cell iysates prepared at 8 h post helper 
virus infections were used for CAT reactions using in one round 50 pi of 
cell lysate each, and in further analyses 5pl and 0.5 pi of lysate (not 

20 shown). Relative yields were determined in comparison to reference 
plasmid pHL2024, as indicated below the figure, with calculations 
restricted to those CAT assays showing less than 40% of substrate 
consumption, in three or more independent experiments. 

(B) Viral passage of B82 supernatants containing recombinant influenza 
25 virus onto MDCK cells, at an moi of 2 to 4, in average. Again at 8 h post 
infection cell Iysates have been prepared and used for CAT assays. 
Relative yields as indicated below the figure have been determined In 
comparison to pHL2024 the same way as in (A), using 0.5 pi of cell lysate 
in each case. (The 50 pi CAT assays as shown here and also in the 
30 following figures intend to give an immediate impression of relative 
activities at always the same level, while the actual measurement data as 
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indicated below the lanes are obtained at various appropriate enzyme 
concentrations relative to reference pHL2024.) 

Fig. 2: Propagation oF recombinant influenza viruses with tandem 
5 bicistronic vRNA: 

(A) General design of expression plasmids for transient bicistronic vRNAs 
coding for GFP in the mRNA-proximal, and for CAT in the mRNA-distal 
position. Among the functional elements indicated are the human RNA 
polymerase I promoter (pih) and murine rDNA terminator (ti) sequences, 
10 both hatched, and the 5' and 3' vRNA promoter cDNA sequences, open 
and closed boxes, respectively. For the vRNA-internai S* promoter signal 
three variant sequences have been inserted as indicated below (positions 
1 to 15 refer to 3'-terminal nucleotides in the resulting monocistronic 
vRNAs). 

15 (B) CAT assays as determined relative to pHL1844 (monocistronic CAT 
construct) after DNA transfection of 293T ceils plus helper virus infection 
followed by one round of progeny viral propagation on MDCK cells are 
indicated below the lanes. Relative activities of the internal promoter 
sequences as indicated above the figure refer to measurements in a 

20 monocistronic externa/ location of the same promoter variants (Flick et a/.. 
Promoter elements in the Influenza vRNA terminal structure, RNA 2, 1046- 
1057 (1996)). Control clone pHL2300 contains an unrelated, non- 
functional sequence in the central location in an otherwise identical 
plasmid construct. 

25 

Fig, 3: Tandem bicistronic vRNA sup-porting an alternative mode of tran- 
scription and replication initiation: 

An additional internal 3' promoter sequence has been inserted in between 
both cistrons, in a vRNA-central position. Left half: bicistronic replication 
30 and transcription leading to (proximal) GFP expression. Right half: internal 
initiation resulting in monocistronic replication and transcription leading to 
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(distal) CAT expression, and causing deletion of the GFP sequence from 
progeny molecules. 



Fig. 4: Outgrowth of promo'ter-u p recombinant vRNA versus wiidtvpe 
5 vRNA segments in stepwise propagation of influenza virus: 

RNA polymerase I transcription of transfected pHL2969 DNA results in 
influenza vRNA carrying (an external) promoter-up mutant '1104', and 
containing 2x2 ribozyme targets in flanking positions relative to its HA 
coding sequence. Another HA vRNA segment in wild-type configuration 

10 and originating from infecting FPV helper virus is also present in the 
recombinant virus preparation, initially (lane 1; 293T lysate) in surplus 
amounts, but reduced and finally lost entirely in consecutive steps of 
propagation (lanes 2 to 4; MDCK cell lysates), and in isolated strains after 
pAM403 ribozyme treatment for removal of the external '1104' promoter 

15 sequence (lane 5). Determination throughout by RT-PCR analyses using a 
pair of primers extending across the 5' inserted target site sequence, with 
435 bp representing the recombinant HA segment, and 306 bp the wild- 
type sequence without an inserted target site sequence. 

20 Fig. 5: pAM403 riboz vme cleavage of PHL2969 derived vRNA molecules at 
specific target sites in serted between an external and an internal 
3' promoter sequence: 

The external promoter-up (•1104") signal is used for yRNA amplification 
within recombinant viruses and reduction of helper virus HA vRNA (Fig. 4), 

25 while the 'switch' to an internal wild-type signal guarantees stable 
replication of recombinant viruses. pAM403 hammerhead ribozyme RNAs 
are indicated in complementary binding to their target site sequences (12 
and 10 nucleotides flanking the GU'C cleavage point) by straight lines 
flanking a central secondary structure symbol. vRNA-internal 2x2 

30 ribozyme targets are marked by xx (see Fig. 7). 
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Fig, 6: Comparative cleavage analysis of model CAT vRNAs with tandem 
target sites in various flanking positions, by target-specific ribozvmes: 
293T tissue culture cells have been transiently DNA-transfected either by 
a single-headed hammerhead' ribozyme (s), or a double-headed (d), or 
5 triple-headed (t) ribozyme cDNA construct, all specifically designed to 
hybridize to a tandem dimer target site sequence inserted in flanking 
positions into the CAT vRNA. All ribozyme RNAs have been expressed from 
the same pSV2-/?eo plasmid vector, including a pSVl-neo control 
construct without an inserted ribozyme cDNA sequence (c). At 20 h after 

10 DNA transfection (which reached 65% yield as measured by Pcmv-GFP 
transfection in parallel of the same cell culture) the 293T cells were 

J infected by CAT recombinant viruses carrying tandem double target 
sequences either only in vRNA-3' position, or in both vRNA-3* and 5' 
positions, or in both vRNA-3' and cRNA-3' positions. Most effective among 

15 the s, d, or t-ribozymes were double-headed constructs, acting on 2x2 
targets inserted In either of the two localizations described (lanes 6 and 
10). 



Fig. 7: Alignment of pAM424 double-headed ribozyme with one of their 
20 repetitive target sequences located within the 5' and 3' vRNA flanking 
regions: 

The superior activity of ribozymes oriented against targets located In the 
3' end of vRNA molecules over those present in the 5' end instead (not 
shown) is in agreement with the model for Influenza vRNA transcription 

25 and replication (Lamb and Krug, Orthomyxoviridae: The viruses and their 
replication. In 'Virology' (B.N. Fields, D.M.Knlpe, P.M.Howley, R.M.Chanock, 
J.L.Melnick, T.P.Monath, B.Roizman, and S.E.Straus, Eds.), 3"^"* ed.. Vol. 1, 
pp. 1353-1395. Lippincott-Raven, Philadelphia (1996)), according to which 
influenza polymerase stays attached to the 5' end of the vRNA molecule 

30 throughout the entire or even several rounds of transcription, whereas the 
very 3' end repeatedly, in every initiation reaction serves as the template 
sequence, and consequently is no longer covered by polymerase. 
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Superiority of a double-headed over a single-headed ribozyme has been 
deternnined earlier in this laboratory (A.Menke, Anti-Influenza Ribozyme: 
vRNA-Spaltung und intrazellulare Aktivitat. Dissertation Universitat 
Giessen (1997)), but the substantial increase of vRNA inactivation rates 
5 upon incorporation of tandem target sites at both ends of the vRNA 
molecule instead of only one has been observed here for the first time, 
v^ithin that overall design. 

Fig. 8: pAM424 ribozyme cleavage of resistant FPV wild-type HA vRNA and 
10 ribozyme-sensitive pHL2969-derived HA-vRNA in 293T cells infected by 
VHM41 after isolation from pHL2969-recombinant viral preparations. 
Lane 1: FPV infection of 293T cells, untreated; lane 2: FPV infection of 
293T cells DNA-transfected by pAM424; lane 3: vHM41 infection of 293T 
cells, untreated; lane 4: vHM41 infection of 293T cells DNA-transfected by 
15 pAM424. RT-PCR analyses of purified viral progeny as in Fig. 4. 

Fig, 9: Functional analysis of the influenza cRNA promoter structure: 

(A) Schematic cRNA promoter ('1104') secondary structure according to 
the 'corkscrew* model; nucleotides involved in single or double nucleotide 

20 exchange are marked by their position. 

(B) CAT analyses of 293T cell lysates after DMA transfection and FPV 
helper virus infection of cRNA promoter variants, in comparison to 
standard vRNA promoter-up mutant '1104' (pHL1844). Nucleotide 
substitutions divergent from the basic '1104* structure as present in 

25 pHL2583 or pHL2721 (see above) are indicated above the lanes, positions 
3 or 8 as marked by a bar refer to cRNA positions counted from the 
3' end. Relative CAT activities are marked below the lanes. 

Fig. 10: Functional analysis of the vRNA and cRNA promoter in ambisense 
30 arrangement: 
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(A) Sequence organisation of the ambisense promoter cDNA construct 
carrying Te/Ae elements adjacent to the terminal sequence, and secondary 
structure predictions for the resulting cRNA and vRNA promoter signal. 

(B) CAT expression data obtained from the cell lysates of 293T cell after 
plasmid DNA transfection and FPV infection, and (C) from cell lysates of 
MDCK cells after one step of viral passage. Indicated above the lanes are 
promoter/gene conjunctions: v = vRNA promoter; c = cRNA promoter. 

Fig. 11: Baseoair s ubstitutions according to the vRNA 'corkscrew' strurhirt^- 

(A) 'Corkscrew' conformation of the vRNA promoter drawn against a 
schematic indication of interacting tripartite viral polymerase. Paired 
positions exchanged in individual experiments are indicated by numbers, 
nucleotides 3 or 8 are counted from the 3' end. pHL2024 containing 
promoter-up mutation '1104' is used as the reference construct (=100%) 
in all of the CAT assays, while pHL2428 represents the wild-type promoter 
structure. 

(B) CAT expression data obtained after one step of viral passage in MDCK 
undiluted, and 50 fold diluted. 



Fig^ — 12i Flow-chart of the Isolation procedure for an ambisense 
recombinant Influenza virus. 

Fiq. 13: Immuno -electron microscopy of purified influenza FPV/CSFV-E2- 
HA virions: 

Recombinant viruses exposing the foreign glycoprotein CSFV-E2 in their 
envelopes, which has been fused onto the HA anchor domain, are marked 
by anti-E2 monospecific antibody and by secondary gold-labelled (5nm) 
goat antibody. Recombinant viruses (16%) are present together with 
their FPV helper viruses. 



Fig- 14: PHL2969; the exact sequence of the 4930 bps circular DNA is 



shown in SEQ. ID NO:l. 
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Fig. 15: pAM403; the exact sequence of the 5811 bps circular DNA is 
shown in SEQ. ID NO:2. 

5 Fig. 16: pAM424; the exact sequence of the 5860 bps circular DNA is 
shown in SEQ. ID NO:5. 

Fig. 17: pHL2507; the exact sequence of the 4610 bps circular DNA is 
shown in SEQ. ID NO:6. 

10 

Fig. 18: pHL2583; the exact sequence of the 3558 bps circular DNA is 
shown in SEQ. ID NO:7. 

Fig. 19: pHL2989; the exact sequence of the 4343 bps circular DNA is 
15 shown in SEQ. ID NO:8. 

Fig. 20: pHL1920; the exact sequence of the 3888 bps circular DNA is 
shown in SEQ. ID NO:9. 

20 Hence, the present invention is based on two surprising findings, namely 

1. influenza virus promoters are active when present internally in a gene; 

2. the so-called cRNA, thought to be an Intermediate in replication can be 
turned into a protein-encoding RNA by equipping it with a variant influenza 
Virus promoter, described in the present invention. 

25 

These two observations were used to make ambisense constructs. This 
allows to package an additional, foreign gene into influenza virus particles. 
Such particles were made previously, by other methods, but proved to be 
unstable, and therefore useless. For use as a vaccine for example, a 
30 helper virus would have been needed as a stabilizer. Stabilization in the 
present invention is achieved by several means. These include the 
"balancing" of one of the two promoters in the ambisense biscistronic 
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genetic construct with seven other vRNA wildtype promoters, while the 
additional promoter is used for high-rate expression of the foreign gene at 
various levels. 

5 Thus, the present invention provides a system for expression of foreign 
proteins in higher eukaryotic systems. One system in particular is 
interesting, namely embryonated chicken eggs, as it allows cost-effective 
production in an automatable way (as used by most flu vaccine 
producers). The reason that this process is now possible, is that the 
10 foreign protein Is part of a stable, engineered influenza virus particle. The 
virus can be designed also to rapidly monitor process improvements. 

il 

An excellent use is of course the use of the construct as a vaccine. The 
influenza virus particle is immunogenic and can now be equipped with 

15 foreign antigens, enabling for example the design and production of 
hepatitis C virus and HIV vaccines, but also of tumor vaccines. As the 
present invention shows, the foreign antigenic surface glycoprotein is 
"fused" to a C-terminal segment of influenza HA, and the antigen then is 
presented at the surface of influenza virus particles. In addition, these 

20 vaccines can now be made in the way standard flu vaccines are made, i.e., 
in embryonated chicken eggs. 
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Claims 

1. A recombinant influenza virus for high-yield expression of incorporated 
foreign gene(s), which is genetically stable in the absence of any helper 
virus and which has eight viral RNA segments, wherein at least one of the 
regular viral RNA segments is replaced by an RNA molecule (ambisense 
RNA segment), said ambisense RNA segment containing one of the 
standard viral genes in sense orientation and a foreign, recombinant gene 
in anti-sense orientation, or wee versa, covalently linked to each other and 
in overall convergent arrangement. 

2. The recombinant virus according to claim 1, wherein in the ambisense 
RNA molecule said foreign recombinant gene is covalently bound to one of 
the viral genes, while the original vRNA segment coding for the same gene 
is deleted from the recombinant virus by way of specific ribozyme 
cleavage. 

3. The recombinant influenza virus according to claims 1 and 2, wherein 
one or more of the regular viral RNA segments, differing from said at least 

|i? one ambisense RNA segment, comprises a vRNA encoding a foreign gene, 
preferably one or more of the regular viral RNA segments has (have) been 
exchanged for a vRNA encoding a foreign gene. 

4. The recombinant influenza virus according to claim 3 in which one or 
both of the standard glycoproteins hemagglutinin and neuraminidase have 
been exchanged into foreign giycoprotein(s) or into fusion glycoproteins 
consisting of an anchor segment derived from hemagglutinin and an 
ectodomain obtained from the foreign source, viral or cellular, or in which 
such recombinant glycoprotein has been inserted as a third molecular 
species in addition to the remaining standard components. 
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5. The recombinant influenza virus according to claims 1 to 4, in which 
the terminal viral RNA sequences of one or more of the regular segments 
and/or of the at least one ambisense RNA segment, which are active as 
the promoter signal, have been modified by nucleotide substitutions in up 
to five positions, resulting in improved transcription rates of both the 
vRIMA promoter as well as the cRNA promoter as present In the 
complementary sequence. 

6. The recombinant influenza virus of claim 5, wherein the 12 nucleotide 
conserved influenza 3' terminal sequence has been modified by 
replacement of one to three nucleotides occurring in said sequence at 
positions 3, 5 and 8 relative to the 3' end by other nucleotides, and/or 
wherein the 13 nucleotide conserved influenza 5' terminal sequence has 
been modified by replacement of one or two nucleotides occurring In said 
sequence at positions 3 and 8 by other nucleotides. 

7. The recombinant influenza virus of claim 6, wherein the replacements 
in the 3' terminal nucleotide sequence comprises the modifications G3A 
and C8U. 

8. The recombinant influenza virus of claim 7, wherein the replacements 
in the 3' terminal nucleotide sequence comprises the modifications G3A, 
U5C and CBU, or G3C, U5C and C8G. 

9. The recombinant influenza virus of claim 8, which comprises a 3" 
terminal nucleotide sequence of 5*-CCUGUUUCUACU-3'. 

10. The recombinant Influenza virus of claims 6 to 9, wherein the 5' 
terminal nucleotide sequence comprises the modifications U3A and A8U 
resulting In a 5'-terminal sequence of 5'-AGAAGAAUCAAGG. 
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11. The recombinant influenza virus according to claims 1 to 10, which is a 
recombinant influenza A virus. 

12. The recombinant influenza virus according to claims 1 to 11, in which 
the foreign gene(s) in ambisense covalent junction with viral gene(s) code 
for proteins and/or glycoproteins which are secreted from cells infected 
with the recombinant virus. 

13-The recombinant virus according to claims 1 to 11, in which the foreign 
gene(s) in ambisense covalent junction with viral gene(s) code for proteins 
or artificial polypeptides designed to support an efficient presentation of 
inherent epitopes at the surface of infected cells, for stimulation of a B cell 
and/or T cell response. 

14. A method for the production of recombinant influenza viruses as 
defined in claims 1 to 13 comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, in 
ambisense design, 

(b) followed by infection with an influenza carrier strain constructed to 
include flanking ribozyme target sequences in at least one of its viral RNA 
segments which is (are) to be replaced by the ambisense segments of 
step (a), and 

(c) thereafter selective vRNA inactivation through ribozyme cleavage. 

15. A pharmaceutical composition comprising a recombinant influenza 
virus according to claims 1 to 13. 

16. Use of a recombinant influenza virus according to claims 1 to 13 for 
preparing a medicament for vaccination purposes. 

17. The use according to claim 16, wherein the medicament 
(a) is suitable against influenza and/or against other infections; 
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(b) is present in form of inactivated preparations; and/or 

(c) is present in form of live recombinant viruses. 

18, Use of a recombinant influenza virus according to claims 1 to 13 for 
preparing agents for somatic gene therapy. 

19. Use of a recombinant influenza virus according to claims 1 to 13 for 
preparing agents, for transfer and expression of foreign genes into cells 
infected by such viruses. 

20- Use of a recombinant influenza virus according to claims 1 to 13 for 
preparing agents for transfer and expression of RNA molecules into cells 
infected by such viruses. 

21. The use of claim 20, wherein the RNA molecules to be expressed are 
antisense sequences or double-strand sequences relative to the target cell 
cellular mRNA molecules, and/or the agent is suitable for sequence- 
specific gene silencing, preferably by antisense RNA or RNA interference 
mechanisms. 

22. The use according to claims 18 to 21, wherein the agents are 
applicable in ex vivo and in vivo application schemes. 

23. A method for the production of proteins or glycoproteins which 
comprises utilizing a recombinant influenza virus according to claims 1 to 
13 as expression vector. 

24. The method of claim 23, wherein the production is performed in cell 
culture cells or in fertilized chicken eggs. 

25. A method for preventing and/or treating influenza which comprises 
administering an effective amount of a recombinant influenza virus 
according to claims 1 to 13 to the mammal to be treated. 



AMENDED SHEET 



26. A method for somatic gene therapy, which method comprises 
subjecting the organism to be treated with a recombinant influenza virus 
according to claims 1 to 13. 

27. A method for transfer and expression of foreign genes into cells, and 
for transfer and expression of RNA molecules into cells, which method 
comprises infecting the cells with a recombinant influenza virus according 
to claims 1 to 13. 

28. Use of a recombinant influenza virus according to claims 1 to 13 for 
preparing agents for autologous immunotherapy. 

29. A method for an immunotherapy which comprises ex vivo infection of 
immune cells with a recombinant Influenza virus according to claims 1 to 
13, and introduction of the transduced cells into the patient. 

30. A method for the Induction of antibodies which comprises utilizing a 
recombinant Influenza virus according to claims 1 to 13 as an immunogen. 
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FIG. 6 
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f I Inserted mandatory headings. specificaJly: 

I I Corrected an obvious error in the response, specifically: 

— — 

( I Edited identifiers where upper case is used but lower case is required, or vice versa. 

I I Corrected an eni^ in the Number of Sequences field specifically: 

f I A "f^ard Page Break" code was insened by the applicant. All occurrences had to be deleted. 

[ I Deleted ending sto|j.codbn in arrlgjo acid sequences and adjusted the "(AjLength:* field accordingly (error 
" due to a Patentin bug). Sequences corrected: . . 



Other: 



f> *Hxaminer: The^bove oorrections must be communicated to tfie applicant in the first Office 
Action. 1)0 NOT send a copy of this form. ^^^^^ 



